(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Oi^anization 

International Bureau 



llllllllllllllllllllllillllllllllllillllii 



(43) International Publication Date 
9 August 2001 (09.08.2001) 



(10) International Publication Number 

WO 01/57274 A2 



(21) International Application Number: PCT/USOl/00666 

(22) International FUing Date: 30 January 2001 (30.01.2001) 
(25) Filing Language: English 



(26) Publication Language: En 


glisli 


(30) Priority Data: 






60/180,312 


4 February 2000 (04.02.2000) 


US 


60/207,456 


26 May 2000 (26.05.2000) 


US 


09/608,408 


30 June 2000 (30.06.2000) 


us 


09/632,366 


3 August 2000 (03.08.2000) 


us 


60/234,687 




us 


60/236,359 


27 September 2000 (27.09.2000) 


us 


0024263.6 


4 October 2000 (04.10.2000) 


GB 



■ (71) AppWcmt (for all designated States except US): MOhEX:- 

j ULAR DYNAMICS, INC. [/US]; 928 East Arques Av- 

j enue, Sunnyvale, CA 94086 (US). 

I (72) Inventors; and 

i (75) Inventors/Applicants (for US only): PENN, Sharron, 

I G. [GBAIS]; 617 South Delaware Street, San Mateo, CA 

i 94402 (US). HANZEL, David, K. [US/US]; 968 Loma 

1 Verde Avenue, Palo Alto, CA 94303 (US). CHEN, Wen- 

j sheng [CN/US] ; 210 Easy Street #25, Mountain View, CA 



94043 (US). RANK, David, R. [USAJS]; 117 El Dorado 
Commons, Fremont, CA 94539 (US). 

(74) Agent: RONNING, Royal, N., Jr.; Amersham Pharmacia 
Biotech, Inc., 800 Centennial Avenue, PO. Box 1327, Pis- 
cataway, NJ 08855 (US). 

(81) Designated States (national): AU, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG. BR, BY, BZ, CA, CH, CN, OR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, R, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, IP, KE, KG, KP, KR, KZ, LC, LK, LR, 
I^, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, FT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, H, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, MI., MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



) (57) Abstract: A single exon nucleic acid microarray comprising a plurality of single exon nucleic acid probes for measuring gene 
► expression in a sample derived from liuman lieart is described. Also described are single exon nucleic acid probes expressed in the 
heart and their use in methods for delecting gene expression. 



wo 01/57274 PCT/USOl/00666 
HUMAN GENOME-DERIVED SINGLE EXON- NUCLEIC ACID PROBES USEFUL 
FOR ANALYSIS OF GENE EXPRESSION IN HUMAN HEART 

CROSS REFERENCE TO RELATED APPLICATIONS 

5 

The present application is a continuation-in-part of U.S. 
patent application serial nos . 0-9/632, 366, filed August 3, 
2000 and 09/608,408, filed June 30, 2000; claims the 
benefit under 35 U.S.C. s 119(e) of U . S . provisional patent 

10 application serial nos. 60/236,359, filed September 27, 
2000, 60/234,687, filed September 21, 2000, 60/207,456, 
filed May 26, 2000, and 60/180,312, filed February 4, 2000; 
and further claims the benefit under 35 U.S.C. s 119(a) of 
UK patent application no. 0024263.6, filed October 4, 2000, 

15 the disclosures of which are incorporated herein by 
reference in their entireties. 

REFERENCE TO SEQUENCE LISTING AND INCORPORATION BY 
REFERENCE THEREOF 

20 

The present application includes a Sequence Listing in 
electronic format, filed pursuant to PCT Administrative 
Instructions 801 - 806 on a single CD-R disc, in 
triplicate, containing a file named pto_HEART.txt, created 
25 24 January 2001, having 20,1.8 6, 94 6 bytes. The Sequence 

Listing contained In said file on said disc is incorporated 
herein by reference in its entirety. 

Field of the Invention 

30 

The present invention relates to genome-derived 
single exon microarrays useful for verifying the expression 
of regions of genomic DNA predicted to encode protein. In 
particular, the present invention relates to unique genome- 
35 derived single exon nucleic acid probes expressed in human 
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heart and single exon nucleic acid microarrays that include 
such probes. 



Background of the Invention 
5 For almost two decades following the invention of 

general techniques for nucleic acid sequencing, Sanger et 
ai., Proc. Natl. Acad. Sci. f/SA 70 (4) : 1209-13 (1973); 
Gilbert et al., Proc. Natl. Acad. Sci. USA 70 ( 12) : 3581-4 
(1973) , these techniques were used principally as tools to 

10 further the understanding of proteins — known or 

suspected — about which a basic foundation of biological 
knowledge had already been built. In many cases, the 
cloning effort that preceded sequence identification had 
been both informed and directed by that antecedent 

15 biological understanding. 

For example, the cloning of the T cell receptor 
for antigen was predicated upon its known or suspected cell 
type-specific expression, by its suspected membrane 
association, and by the predicted assembly of its gene via 

20 T cell-specific somatic recombination. Subsequent 
sequencing efforts at once confirmed and extended 
understanding of this family of proteins. Hedrick et al., 
JJature 308 (5955) : 153-8 (1984). 

More recently, however, the development of high 

25 throughput sequencing methods and devices, in concert with 
large public and private undertakings to sequence the human 
and other genomes, has altered this investigational 
paradigm: today, sequence information often precedes 
understanding of the basic biology of the encoded protein 

30 product. 

One of the approaches to large-scale sequencing 
is predicated upon the proposition that expressed 
sequences - that is, those accessible, through isolation of 
mRNA - are of greatest initial interest. This "expressed 
35 sequence tag" ("EST") approach has already yielded vast 

2 
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amounts of sequence data (see for example Adams et al., 
Science 252:1651 (1991); Williamson, Drug Discov. Today 
4:115 (1999)). For nucleic acids sequenced by this 
approach, often the only biological information that is 
5 known a priori with any certainty is the likelihood of 
biologic expression itself. By virtue of the species and 
tissue from which the mRNA had originally been obtained, 
most such sequences ar.e also annotated with the identity of 
the species and at least one tissue in which expression 

10 appears likely. 

More recently, the pace of genomic sequencing has 
accelerated dramatically. When genomic DNA serves as the 
initial substrate for sequencing efforts, expression cannot 
be presumed; often the only a priori biological information 

15 about the sequence , includes the species and chromosome (and 
perhaps chromosomal map location) of origin. 

With the ever-accelerating pace of sequence 
acciamulation by directed, EST, and genomic sequencing 
approaches — and in particular, with the accumulation of 

20 sequence information from multiple genera, from multiple 

species within genera, and from multiple individuals within 
a species — there is an increasing need for methods that 
rapidly and effectively permit the functions of nucleic 
sequences to be elucidated. And as such functional 

25 information accumulates, there is a further need for 
methods of storing such functional information in 
meaningful and useful relationship to the sequence itself; 
that is, there is an increasing need for means and 
apparatus for annotating raw sequence data with known or 

30 predicted functional information. 

Although the increase in the pace of genomic 
sequencing is due in large part to technological changes in 
sequencing strategies and instriamentation. Service, Science 
280:995 (1998); Pennisi, Science 283: 1822-1823 (1999), 

35 there is an important functional motivation as well. 

3 
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While it was understood that the EST approach 
would rarely be able to yield sequence information about 
the noncoding portions of the genome, it now also appears 
the EST approach is capable of capturing only a fraction of 
5 a genome's actual expression complexity. 

For example, when the C. elegans genome was fully 
sequenced, gene prediction algorithms identified over 
19,000 potential genes, of which only 7,000 had been found 
by EST sequencing. C. elegans Sequencing Consortium, 

10 Science 282:2012 (1998). Analogously, the recently 

completed sequence of chromosome 2 of Arabidopsis predicts 
over 4000 genes, Lin et ai., Nature, 402:761 (1999), of 
which only about 6% had previously been identified via EST 
sequencing efforts. Although the human genome has the 

15 greatest depth of EST coverage, it is still woefully short 
of surrendering all of its genes. One recent estimate 
suggests that the human genome contains more than 146,000 
genes., which would at this point leave greater than half of 
the genes undiscovered. It is now predicted that many 

20 genes, perhaps 20 to 50%, will only be found by genomic 
sequencing. 

There is, therefore, a need for methods that 
permit the functional regions of genomic sequence — and 
most importantly, but not exclusively, regions that 

25 function to encode genes - to be identified. 

Much of the coding sequence of the human genome 
is not homologous to known genes, making detection of open 
reading frames ("ORFs") and predictions of gene function 
difficult. Computational methods exist for predicting 

30 coding regions in eukaryotic genomes. Gene prediction 
programs such as GRAIL and GRAIL II, Uberbacher et al., 
Proc. Natl. Acad. Sci. USA 88 (24) : 11261-5 (1991); Xu et 
al.. Genet. Eng. 16:241-53 (1994); Uberbacher et al., 
Methods Enzymol. 266:259-81 (1996); GENEFINDER, Solovyev et 

35 al., Nucl. Acids. Res. 22:5156-63 (1994); Solovyev et al., 

4 
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Ismb 5:294-302 (1997); and GENESCAN, Burge et al., J. Mol. 
Biol. 268:78-94 (1997), predict many putative genes without 
known homology or function. Such programs are known, 
however, to give high false positive rates. Burset et al., 
5 Genomics 34:353-367 (1996). Using a consensus obtained by 
a plurality of such programs is known to increase the 
reliability of calling exons from genomic sequence. 
Ansari-Lari et al., Genome Res. 8(l):29-40 (1998) 

Identification of functional genes from genomic 

10 data remains, however, an imperfect art. For example, in 
reporting the full sequence of human chromosome 21, the 
Chromosome 21 Mapping and Sequencing Consortium reports 
that prior bioinf ormatic estimates of human gene number may 
need to be revised substantially downwards. Nature 

15 405:311-199 (2000); Reeves, Nature 405:283-284 (2000). 

Thus, there is a need for methods and apparatus 
that permit the functions of the regions identified 
bioinformatically - and specifically, that permit the 
expression of regions predicted to encode protein - readily 

20 to be confirmed experimentally. 

Recently, the development of nucleic acid 
microarrays has made possible the automated and highly 
parallel measurement of gene expression. Reviewed in 
Schena (ed. ) , DNA Microarrays : A Practical Approach 

25 (Practical Approach Series ), Oxford University Press (1999) 
(ISBN: 0199637768); Nature Genet. 21(l)(suppl):l - 60 
(1999); Schena (ed.), Mlcroarray Biochip: Tools and 
Technology , Eaton Publishing Company/BioTechniques Books 
Division (2000) (ISBN: 1881299376) . ■ 

30 It is common for microarrays to be derived from 

cDNA/EST libraries, either from those previously described 
in the literature, such as those from the I.M.A.G.E. 
consortium, Lennon et al., Genomics 33(1): 151-2 (1996), or 
from the construction of "problem specific" libraries 

35 targeted at a particular biological question, R.S. Thomas 
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efc al., Cancer Res. (in press). Such microarrays by 
definition can measure expression only of those genes found 
in EST libraries, and thus have not been useful as probes 
for genes discovered solely by genomic sequencing. 
5 The utility of using whole genome nucleic acid 

microarrays to answer certain biological questions has been 
demonstrated for the yeast Saccharomyces cerevisiae . De 
Risi et al., Science 278:680 (1997). The vast majority of 
yeast nuclear genes, approximately 95% however, are single 

10 exon genes, i.e., lack introns, Lopez efc al., RNA 5:1135- 
1137 (1999); Goffeau et al., Science 274:563-67 (1996), 
permitting coding regions more readily to be identified. 
Whole genome nucleic acid microarrays have not generally 
been used to probe gene expression from more complex 

15 eukaryotic genomes, and in particular from those averaging 
more than one intron per gene. 

Diseases of the heart and vascular system are a 
significant cause of human morbidity and mortality. 
Increasingly, genetic factors are being found that 

20 contribute to predisposition, onset, and/or aggressiveness 
of most, if not all, of these diseases. Although mutations 
in single genes have on occasion been identified as 
causative, these disorders are for the most part believed 
to have polygenic etiologies. There is a need for methods 

25 and apparatus that permit prediction, diagnosis and 
prognosis of diseases of the human heart, particularly 
those diseases with polygenic etiology. 

Summary of the Invention 

30 

The present invention solves these and other 
problems in the art by providing methods and apparatus for 
predicting, confirming, and displaying functional 
information derived from genomic sequence. The present 
35 invention also provides apparatus for verifying the 
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expression of putative genes identified within genomic 
sequence . 

In particular, the invention provides novel 
genome-derived single exon nucleic acid microarrays useful 
5 for verifying the expression of putative genes identified 
within genomic sequence. 

The present invention also provides compositions 
and kits for the ready production of nucleic acids 
identical in sequence to, or substantially identical in ' 
10 sequence to, probes on the genome-derived single exon 
microarrays of the present invention. 

Accordingly, in a first aspect of the invention, 
there is provided a spatially-addressable set of single 
exon nucleic acid probes for measuring .gene expression in a 
15 sample derived from human heart, comprising a plurality of 
single exon nucleic acid probes according to any one of the 
nucleotide sequences set out in SEQ ID NOs : 1 - 9,98 0 or a 
complementary sequence, or a portion of such a sequence. 

By plurality is meant at least two, suitably at 
20 least 20, most suitably at least 100, preferably at least 
1000 and, most preferably, upto 5000. 

In one embodiment of the first aspect, each of 
said plurality of probes is separately and addressably 
amplif iable. 

25 In an alternative embodiment, each of said 

plurality of probes is separately and addressably 

isolatable from said plurality. 

In a preferred embodiment, each of said plurality 

of probes is amplif iable using at least one common primer. 
30 Preferably, each of said plurality of probes is amplifiable 

using a first and a second common primer. 

In yet another embodiment, said set of single 

exon nucleic acid probes comprises between 50 - 20,000 

probes, for example, 50 - 5000. 
35 Suitably, said set of single exon nucleic acid 

7 



wo 01/57274 PCT/USOl/00666 

probes comprises at least 50 - 1000 discrete single exon 
nucleic acid probes having a sequence as set out in any of 
SEQ ID NOS.: 1 - 19,771 or a complimentary sequence, or a 
portion of such a sequence. 
5 Preferably, the average length of the single exon 

nucleic acid probes is between 200 and 500 bp. It is 
preferred that the average length should be at least 200bp, 
suitably at least 250bp, most suitably at least 300bp, 
preferably at least 400bp and, most preferably, 500 bp. 

10 In another embodiment, the single exon nucleic 

acid probes lack prokaryotic and bacteriophage vector 
sequence. It is preferred that at least 50%, suitably at 
least 60%, most suitably at least 70%, preferably at least 
75%, more preferably at least 80, 85, 90, 95 or 99% of said 

15 single exon nucleic acid probes lack prokaryotic and 
bacteriophage vector sequence. 

In another preferred embodiment, said single exon 
nucleic acid lack homopolymeric stretches of A or T. It is 
preferred that at least 50%, suitably at least 60%, most 

20 suitably at least 70%, preferably at least 75%, more 

preferably at least 80, 85, 90, 95 or. 99% of said single 
exon nucleic acid probes lack homopolymeric stretches of A 
' or T. 

Preferably, a spatially-addressable set of single 
25 exon nucleic acid probes in accordance with the first 

aspect of the invention is is addressably disposed upon a 
substrate . 

Suitable substrates include a filter membrane 
which may, preferably, be nitrocellulose or nylon. The 

30 nylon may preferably, be positively-charged. Other suitable 
substrates include glass, amorphous silicon, crystalline 
silicon, and plastic. Further suitable materials include 
polymethylacrylic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, polyvinylchloride, 

35 polytetraf luoroethylene, polystyrene, polycarbonate, 

8 
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polyacetal, polysulfone, celluloseacetate, 
cellulosenitrate, nitrocellulose, and mixtures thereof. 

In a second aspect of the invention, there is 
provided a microarray comprising a spatially addressable 
5 set of single exon nucleic acid probes in accordance with 
the first aspect of the invention. 

In one embodiment, a genome-derived single-exon 
microarray is packaged together with such an ordered set of 
amplifiable probes corresponding to the probes, or one or 
10 more subsets of probes, thereon. In alternative 

embodiments, the ordered set of amplifiable probes is 
packaged separately from the genome-derived single exon 
microarray. 

In another aspect, the invention provides genome- 

15 derived single exon nucleic acid probes useful for gene 
expression analysis, and particularly for gene expression 
analysis by microarray. In particular embodiments of this 
aspect, the present inventio- provides human single-exon 
probes that include specif ically-hybridizable fragments of 

20 SEQ ID Nos. 9,981 - 19,771, wherein the fragment hybridizes 
at high stringency to an expressed human gene. In 
particular embodiments, the invention provides single exon 
probes comprising SEQ ID Nos. 1 - 9,980. 

Accordingly, in a third aspect of the invention, 

25 there is provided a single exon nucleic acid probe for 
measuring human gene expression in a sample derived from 
human heart which is a nucleic acid molecule comprising a 
nucleotide sequence as set out in any of SEQ ID NOs.: 1 - 
9,980 or a complementary sequence or a fragment thereof 

30 wherein said probe hybridizes at high stringency to a 
nucleic acid expressed in the human heart. 

In one embodiment, a single exon nucleic acid 
probe in accordance with the third aspect comprises a 
nucleotide sequence as set out in any of SEQ ID NOs.: 9,981 

35 - 19,771 or a complementary sequence or a fragment thereof. 

9 
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In a fourth aspect of the invention, there is 
provided a single exon nucleic acid probe for measuring 
human gene expression in a sample derived from human heart 
which is a nucleic acid molecule having a sequence encoding 

5 a peptide comprising a peptide sequence as set out in any 
of SEQ ID NOs.: 19,772 - 29,119 or a complementary sequence 
or a fragment thereof wherein said probe hybridizes at high 
stringency to a nucleic acid expressed in the human heart. 

Preferably, a single exon nucleic acid probe in 

10 accordance with the third or fourth aspects of the 

invention comprises between at least 15 and 50 contiguous 
nucleotides of said SEQ ID NO: . It is preferred that the 
single exon nucleic acid probe comprises at least 15, 
suitably at least 20, more suitably at least 25 or 

15 preferably at least 50 contiguous nucleotides of said SEQ 
ID NO: . 

In another preferred embodiment, a single exon 
nucleic acid probe . in accordance with the third or fourth 
aspects of the invention is between 3k.b and 25kb in length. 

20 It is preferred that said probe is no more than 3kb, 

suitably no more than Skb, more suitably no more than lOkb, 
preferably 15kb, more preferably 20kb or, most preferably, 
no more than 20kb in length. 

Preferably, a single exon nucleic acid probe in 

25 accordance with either the fifth or sixth aspect of the 
invention is DNA, preferably single-stranded DNA, RNA or 
PNA. 

In another embodiment of either the third or 
fourth aspect of the invention, a single exon nucleic acid 

30 probe is detectably labeled. Suitable detectable labels 
include a radionuclide, a fluorescent label or a first 
member of a specific binding pair. Suitable fluorescent 
labels include dyes such as cyanine dyes, preferably Cy3 
and CyS although other suitable dyes will be known to those 

35 skilled in the art. 
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In a particularly preferred embodiment, a single 
exon nucleic acid probe in accordance with either the third 
or fourth aspect of the invention lacks prokaryotic and 
bacteriophage vector sequence. In yet another embodiment, a 
5 single exon nucleic acid probe in accordance with either 
the third or fourth aspect of the invention lacks 
homopolymeric stretches of A or T. 

In a fifth aspect of the invention, there is 
provided an amplifiable nucleic acid composition, 
10 comprising: 

the single exon nucleic acid probe in accordance 
with either of the third or fourth aspects of the 
invention; and at least one nucleic acid primer; 

wherein said at least one primer is sufficient to 
15 prime enzymatic amplification of said probe. 

In an sixth aspect of the invention, there is 
provided a method of measuring gene expression in a sample 
derived from human heart, comprising: 

contacting the single exon microarray in 
20 accordance with the second aspect of the invention, with a 
first collection of detectably labeled nucleic acids, said 
first collection of nucleic acids derived from mRNA of 
human heart; and then 

measuring the label detectably bound to each 
25 probe of said microarray. 

In a seventh aspect of the invention, there is 
provided a method of identifying exons in a eukaryotic 
genome, comprising: 

algorithmically predicting at least one exon from 
30 genomic sequence of said eukaryote; and then 

detecting specific hybridization of detectably 
labeled nucleic acids to a single exon probe, 

wherein said detectably labeled nucleic acids are 
derived from mRNA from the heart of said eukaryote, said 
35 probe is a single exon probe having a fragment identical in 
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sequence to, or complementary in sequence to, said 
predicted exon, said probe is included within a single exon' 
microarray in accordance with the first aspect of the 
invention, and said fragment is selectively hybridizable at 
5 high stringency. 

In a eighth aspect of the invention, there is 
provided a method of assigning exons to a single gene, 
comprising: 

identifying a plurality of exons from genomic 
10 sequence in accordance with the seventh aspect of the 
invention; and then 

measuring the expression of each of said exons in 
a plurality of tissues and/or cell types using 
hybridization to single exon microarrays having a probe 
15 with said exon, 

wherein a common pattern of expression of said 
exons in said plurality of tissues and/or cell types 
indicates that the exons should be assigned to a single 
gene. 

20 In an ninth aspect of the invention, there is 

■ provided a nucleic acid sequence as set out in any of SEQ 
ID NOs: 1 - 19,771 wherein said sequence encodes a peptide. 

In a tenth aspect of the invention, there is 
provided a peptide encoded by a sequence comprising a 
25 sequence as set out in any of SEQ ID NOs: 9,981 - 19,771, 
or a complementary sequence or coding portion thereof. 

In a preferred embodiment, a peptide may be 
encoded by a sequence comprising a sequence set out in any 
of SEQ ID NOS. : 1 - 9, 980. 
30 In a further aspect, the , invention provides 

peptides comprising an amino acid sequence translated from 
the DNA fragments, said amino acid sequences comprising SEQ 
ID NOS. : 9, 981 - 19,771. 

Accordingly in a eleventh aspect of the invention 
35 there is provided a peptide comprising a sequence as set 



12 



wo 01/57274 PCT/USOl/00666 

out in any 'of SEQ ID NOs : 19,772 - 29, 119, or fragment 
thereof . 

In another aspect, the invention provides means 
for displaying annotated sequence, and in particular, for 
5 displaying sequence annotated according to the methods and 
apparatus of the present invention. Further, such display 
can be used as a preferred graphical user interface for 
electronic search, query, and analysis of such annotated 
sequence . 



Detailed Description of the Invention 



Definitions 

15 As used herein, the term "microarray" and phrase 

"nucleic acid microarray" refer to. a substrate-bound 
collection of plural nucleic acids, hybridization to each 
of the plurality of bound nucleic acids being separately 
detectable. The substrate can be solid or porous, planar 

20 or non-planar, unitary or distributed. 

As so defined, the term "microarray" and phrase 
"nucleic acid microarray" include all the devices so called 
in Schena (ed.), DNA Microarrays ; A Practical Approach 
(Practical Approach Series ), Oxford University Press (1999) 

25 (ISBN: 0199637768); Nature Genet. 21(l)(suppl):l - 60 
(1999); and Schena (ed.), Microarray Biochip: Tools and 
Technology , Eaton Publishing Company/BioTechniques Books 
Division (2000) (ISBN: 1881299376). As so defined, the 
term "microarray" and phrase "nucleic acid microarray" 

30 further include substrate-bound collections of plural 

nucleic acids in which the nucleic acids are distributably 
disposed on a plurality of beads, rather than on a unitary 
planar substrate, as is described, inter alia, in Brenner 
et al., Proc. Natl. Acad. Sci. USA 97 ( 4 ): 166501670 (2000); 

35 in such case, the term "microarray" and phrase "nucleic 

13 
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acid microarray" refer to the plurality of beads in 
aggregate . 

As used herein with respect to a nucleic acid 
microarray, the term "probe" refers to the nucleic acid 
5 that is, or is intended to be, bound to the substrate; in 
such context, the term "target" thus refers to nucleic acid 
intended to be bound thereto by Watson-Crick 
complementarity. As used herein with respect to solution 
phase hybridization, the term "probe" refers to the nucleic 
10 acid of known sequence that is detectably labeled. 

As used herein, the expression "probe comprising 
SEQ ID NO.", and variants thereof, intends a nucleic acid 
probe, at least a portion of which probe has either (i) the 
sequence directly as given in the referenced SEQ ID NO,, or 
15 (ii) a sequence complementary to the sequence as given in 
the referenced SEQ ID NO., the choice as between sequence 
directly as given and complement thereof dictated by the 
requirement that the probe hybridize to mRNA. 

As used herein, the term "open reading frame" and 
20 the equivalent acronym "ORF" refer to that portion of an 
exon that can be translated in its entirety into a sequence 
of contiguous amino acids i.e. a nucleic acid sequence 
that, in at least one reading frame, does not possess stop 
codons; the term does not require that the ORF encode the 
25 entirety of a natural protein. 

As used herein, the term "'''amplicon" refers to a 
PGR product amplified from human genomic DNA, containing 
the predicted exon. 

As used herein the term ''^exon" refers to the 
30 consensus prediction of the various exon and gene 
predicting algorithms i.e. a nucleic acid sequence 
bioinformatically predicted to encode a portion of a 
natural protein. 

As used herein, the term "peptide" refers to a 
35 sequence of amino acids. The sequences referred to as 

14 
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PEPTIDE SEQ ID NOS . : are the predicted peptide sequences 
that would be translated from one of the exons, or a 
portion thereof set out in axon SEQ ID NOS.:. The codons 
encoding the peptide are wholly contained within the axon. 
5 As used herein, a portions" of a defined 

nucleotide sequence or sequences can be and, preferably, 
are fragments unique to that sequence or to one or a 
combination of those sequences. A fragment unique to a 
nucleic acid molecule is one that is a signature for the 

10 larger nucleic acid molecule. 

As used herein, the phrase "expression of a 
probe" and its linguistic variants means that the ORF 
present within the probe, or its complement, is present 
within a target mRNA. 

15 As used herein, "stringent conditions" refers to 

parameters well known to those skilled in the art. When a 
nucleic acid molecule is said to be hybridisable to another 
of a given sequence under "stringent conditions" it is 
meant that it is homologous to the given sequence. 

20 As used herein, the phrase "specific binding 

pair" intends a pair of molecules that bind to one another 
with high specificity. Binding pairs are said to exhibit 
specific binding when they exhibit avidity of at least lO'^, 
preferably at least 10^, more preferably at least 10^ 

25 liters/mole. Nonlimiting examples of specific binding 
pairs are: antibody and antigen; biotin and avidin; and 
biotin and streptavidin. 

As used herein with respect to the visual display 
of annotated genomic sequence, the term "rectangle" means 

30 any geometric shape that has at least a first and a second 
border, wherein the first and second borders each are 
capable of mapping uniquely to a point of another visual 
object of the display. 

As used herein, a "Mondrian" means a visual 

35 display in which a single genomic sequence is annotated 
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with predicted and experimentally confirmed functional 
information. 



5 Brief Description of the Drawings 

The present invention is further illustrated with 
reference to the following non-limiting figures and 
examples in which: 

10 FIG. 1 illustrates a process for predicting 

functional regions from genomic sequence, confirming the 
functional activity of such regions experimentally, and 
associating and displaying the data so obtained in 
meaningful and useful relationship to the original sequence 

15 data; 

FIG. 2 further elaborates that portion of the 
process schematized in FIG. 1 for predicting functional 
regions from genomic sequence; 

FIG. 3 illustrates a Mondrian visual display; 
20 FIG. 4 presents a Mondrian showing a hypothetical 

annotated genomic sequence; 

FIG. 5 is a histogram showing the distribution of 
ORF length and PGR products as obtained, with ORF length 
shown in black and PGR product length shown in dotted 
25 lines; 

FIG. 6 is a histogram showing the distribution, 
among exons predicted according to the methods described, 
of expression as measured using simultaneous two color 
hybridization to a genome-derived single exon microarray. 

30 The graph shows the number of sequence-verified products 
that were either not expressed ("0"), expressed in one or 
more but not all tested tissues ("1" - "9"), or expressed 
In all tissues tested ("10"); 

FIG. 7 is a pictorial representation of the 

35 expression of verified sequences that showed expression 
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with signal intensity greater than 3 in at least one 
tissue, with: FIG. 7A showing the expression as measured by 
microarray hybridization in each of the 10 measured 
tissues, and the expression as measured "bioinf ormatically" 
5 by query of EST, NR and SwissProt databases; with FIG. 7B 
showing the legend for display of physical expression 
(ratio) in FIG. 7A; and with FIG. 7C showing the legend for 
scoring EST hits as depicted in FIG. 7A; 

FIG. 8 shows a comparison of normalized CY3 

10 signal intensity for arrayed sequences that were identical 
to sequences in existing EST, NR and SwissProt databases or 
that were dissimilar (unknown) , where black denotes the 
signal intensity for all sequence-verified products with a 
BLAST Expect ("E") value of greater than le-30 (1 x 10"^°) 

15 ("unknown") and a dotted line denotes sequence-verified 

spots with a BLAST expect ("E") value of less than le-30 (1 
X 10"^°) ("known") ; 

FIG. 9 presents a Mondriar. of BAG AC008172 (bases 
25,000 to 130,000), containing the carbamyl phosphate 

20 synthetase gene (AF154830.1); and 

FIG. 10 is a Mondrian of BAG A049839. 



Methods and Apparatus for Predicting, Confirming, 
25 Annotating, and Displaying Functional Regions From Genomic 
Sequence Data 

FIG. 1 is a flow chart illustrating in broad 
outline a process for predicting functional regions from 
30 genomic sequence, confirming and characterizing the 

functional activity of such regions experimentally, and 
then associating and displaying the information so obtained 
in meaningful and useful relationship to the original 
sequence data. 

35 The initial input into process 10 of the present 
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invention is drawn from one or more databases 100 
containing genomic sequence data. Because genomic sequence 
is usually obtained from subgenomic fragments, the sequence 
data typically will be stored in a series of records 
5 corresponding to these subgenomic sequenced fragments. 
Some fragments will have been catenated to form larger 
contiguous sequences ("contigs"); others will not. A 
finite percentage of sequence data in the database will 
typically be erroneous, consisting inter alia of vector 

10 sequence, sequence created from aberrant cloning events, 
sequence of artificial polylinkers, and sequence that was 
erroneously read. 

Each sequence record in database 100 will 
minimally contain as annotation a unique sequence 

15 identifier (accession number) , and will typically be 
annotated further to identify the date of accession, 
species of origin, and depositor. Because database 100 can 
contain nongenomic sequence, each sequence will typically 
be annotated further to permit query for genomic sequence. 

20 Chromosomal origin, optionally with map location, can also 
be present. Data can be, and over time increasingly will 
be, further annotated with additional information, in part 
through use of the present invention, as described below. 
Annotation can be present within the data records, in 

25 information external to database 100 and linked to the 
records thereto, or through a combination of the two. 

Databases useful as genomic sequence database 100 
in the present invention include GenBank, and particularly 
include several divisions thereof, including the 

30 htgs (draft), NT (nucleotide, command line), and NR 

(nonredundant ) divisions. GenBank is produced by the 
National Institutes of Health and is maintained by the 
National Center for Biotechnology Information (NCBI) . 
Databases of genomic sequence from species other than 

35 human, such as mouse, rat, Arabidopsis, C. elegans, C. 
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brigsii, Drosophila, zebra fish, and other higher 
eukaryotic organisms will also prove useful as genomic 
sequence database 100. 

Genomic sequence obtained by query of genomic 
5 sequence database 100 is then input into one or more 

processes 200 for identification of regions therein that 
are predicted to have a biological function as specified by 
the user. Such functions include, but are not limited to, 
encoding protein, regulating transcription, regulating 

10 message transport after transcription into mRNA, regulating 
message splicing after transcription into mRNA, of 
regulating message degradation after transcription into 
mRNA, and the like. Other functions include directing 
somatic recombination events, contributing to chromosomal 

15 stability or movement, contributing to allelic exclusion or 
X chromosome inactivation, and the like. 

The particular genomic sequence to be input into 
process 200 will depend upon the function for which 
relevant sequence is to be identified as well as upon the 

20 approach chosen for such identification. Process step 200 
can be iterated to identify different functions within a 
given genomic region. In such case, the input often will 
be different for the several iterations. 

Sequences predicted to have the requisite 

25 function by process 200 are then input into process 300, • 
where a subset of the input sequences suitable for 
experimental confirmation is identified. Experimental 
confirmation can involve physical and/or bioinf ormatic 
assay. Where the subsequent experimental assay is 

30 bioinformatic, rather than physical, there are fewer 

constraints on the sequences that can be tested, and in 
this latter case therefore process 300 can output the 
entirety of the input sequence. 

The subset of sequences output from process 300 

35 is then used in process 400 for experimental verification 
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and characterization of the function predicted in 

process 200, which experimental verification can, and often 

will, include both physical and bioinf ormatic assay. 

Process 500 annotates the sequence data with the 
5 functional information obtained in the physical and/or 
bioinf ormatic assays of process 400. 'Such annotation can 
be done using any technique that usefully relates the 
functional information to the sequence, as, for example, by 
incorporating the functional data into the sequence data 

10 record itself, by linking records in a hierarchical or 

relational database, by linking to external databases, by a 
combination thereof, or by other means well known within 
the database arts. The data can even be submitted for 
incorporation into databases maintained by others, such as 

15 GenBank, which is maintained by NCBI. 

As further noted in FIG. 1, additional annotation 
can be input into process 500 from external sources 600. 

The annotated data is then displayed in process 
800, either before, concomitantly with, or after optional 

20 storage 7 00 on nontransient media, such as magnetic disk, 
optical disc, magnetooptical disk, flash memory, or the 
like. 

FIG. 1 shows that the experimental data output 
from process 400 can be used in each preceding step of 

25 process 10: e.g., facilitating identification of functional 
sequences in process 200, facilitating identification of an 
experimentally suitable subset thereof in process 300, and 
facilitating creation of physical and/or informational 
substrates for, and performance of subsequent assay, of 

30 functional sequences in process 400. 

Information from each step can be passed directly 
to the succeeding process, or stored in permanent or 
interim form prior to passage to the succeeding process. 
Often, data will be stored after each, or at least a 

35 plurality, of such process steps. Any or all process steps 
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can be automated. 

FIG. 2 further elaborates the prediction of 
functional sequence within genomic sequence according to 
process 200. 

5 Genomic sequence database 100 is first queried 20 

for genomic sequence. 

The sequence required to be returned by query 20 
will depend, in the first instance, upon the function to be 
identified. 

10 For example, genomic sequences that function to 

encode protein can be identified inter alia using gene 
prediction approaches, comparative sequence analysis 
approaches, or combinations of the two. In gene prediction 
analysis, sequence from one genome is input into process 

15 200 where at least one, preferably a plurality, of 

algorithmic methods are applied to identify putative coding 
regions. In comparative sequence analysis, by contrast, 
corresponding, e.g., syntenic, sequence from a plurality of 
sources, typically a plurality of species, is input into 

20 process 200, where at least one, possibly a plurality, of 
algorithmic methods are applied to compare the sequences 
and identify regions of least variability. 

The exact content of query 20 will also depend 
upon the database queried. For example, if the database 

25 contains both genomic and nongenomic sequence, perhaps 
derived from multiple species, and the function to be 
determined is protein coding regions in human genomic 
sequence, the query will accordingly require that the 
sequence returned be genomic and derived from humans. 

30 Query 20 can also incorporate criteria that 

compel return of sequence that meets operative requirements 
of the subsequent analytical method. Alternatively, or in 
addition, such operative criteria can be enforced in 
subsequent preprocess step 24. 

35 For example, if the function sought to be 
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identified is protein coding, query 20 can incorporate 
criteria that return from genomic sequence database 100 
only those sequences present within contigs sufficiently 
long as to have obviated substantial fragmentation of any 
5 given exon among a plurality of separate sequence 
fragments . 

Such criteria can, for example, consist of a 
required minimal individual genomic sequence fragment 
length, such as 10 kb, more typically 20 kb, 30 kb, 40kb, 

10 and preferably 50 kb or more, as well as an optional 

further or alternative requirement that sequence from any 
given clone, such as a bacterial artificial chromosome 
("BAG"), be presented in no more than a finite maximal 
number of fragments, such as no more than 20 separate 

15 pieces, more typically no more than 15 fragments, even more 
typically no more than about 10 - 12 fragments. 

Results using the present invention have shown 
that genomic sequence from bacterial artificial chromosomes 
(BACs) is sufficient for gene prediction analysis according 

20 to the present invention if the sequence is at least 50 kb 
in length, and if additionally the sequence from any given 
BAG is presented in fewer than 15, and preferably fewer 
than 10, fragments. Accordingly, query 20 can incorporate 
a requirement that data accessioned from BAG sequencing be 

25 in fewer than 15, preferably fewer than 10, fragments. 

An additional criterion that can be incorporated 
into the query can be the date, or range of dates, of 
sequence accession. Although the process has been 
described above' as if genomic sequence database ICQ wejre 

30 static, it is of course understood that the genomic 
sequence databases need not be static, and indeed are 
typically updated on a frequent, even hourly, basis. Thus, 
as further described in Examples 1 and 2, infra, it is 
possible to query the database for newly added sequence, 

35 either newly added after an absolute date, or newly added 
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relative to a prior analysis performed using the methods 
and apparatus of the present invention. In this way, the 
process herein described can incorporate a dynamic, 
temporal component . 
5 One utility of such temporal limitation is to 

identify, from newly accessioned genomic sequence, the 
presence of novel genes, particularly those not previously 
identified by EST sequencing (or other sequencing efforts 
that are similarly based upon gene expression) . As further 

10 described in Example 1, such an approach has shown that 

newly accessioned human genomic sequence, when analyzed for 
sequences that function to encode protein, readily 
identifies genes that are novel over those in existing EST 
and other expression databases. This makes the methods of 

15 the present invention extremely powerful gene discovery 
tools. And as would be appreciated, such gene discovery 
can be performed using genomic sequence from species other 
than human. 

If query 20 incorporates multiple criteria, such 
20 as above-described, the multiple criteria can be performed 
as a series of separate queries or as a single query, 
depending in part upon the query language, the complexity 
of the query, and other considerations well known in the 
database arts . 

25 If query 20 returns no genomic sequence meeting 

the query criteria, the negative result can be reported by 
process 22, and process 200 (and indeed, entire process 10) 
ended 23, as shown. Alternatively, or in addition to 
report and termination of the initial inquiry, a new query 

30 20 can be generated that takes into account the initial 
negative result. 

When query 20 returns sequence meeting the query 
criteria, the returned sequence is then passed to optional 
preprocessing 2,4, suitable and specific for the desired 

35 analytical approach and the particular analytical methods 
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thereof to be used in process 25. 

Preprocessing 24 can include processes suitable 
for many approaches and methods thereof, as well as 
processes specifically suited for the intended subsequent 
5 analysis. 

Preprocessing 24 suitable for most approaches and 
methods will include elimination of sequence irrelevant to, 
or that would interfere with, the subsequent analysis. 
Such sequence includes repetitive sequence, such as Alu 

10 repeats and LINE elements, vector sequence, artificial 
sequence, such as artificial polylinkers , and the like. 
Such removal can readily be performed by identification and 
subsequent masking of the undesired sequence. 

Identification can be effected by comparing 'the 

15 genomic sequence returned by query 20 with public or 
private databases containing known repetitive sequence, 
vector sequence, artificial sequence, and other artifactual 
sequence. Such comparison can readily be done using 
programs well known in the art, such as CROSS_MATCH, or by 

20 proprietary sequence comparison programs the engineering of 
which is well within the skill in the art. 

Alternatively, or in addition, undesirable, 
including artifactual, sequence can be identified 
algorithmically without comparison to external databases 

25 and thereafter removed. For example, synthetic polylinker 
sequence can be identified by an algorithm that identifies 
a significantly higher than average density of known 
restriction sites. As another example, vector sequence can 
be identified by algorithms that identify nucleotide or 

30 codon usage at variance with that of the bulk of the 
genomic sequence. 

Once identified, undesired sequence can be 
removed. Removal can usefully be done by masking the 
undesired sequence as, for example, by converting the 

35 specific nucleotide references to one that is unrecognized 
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by the subsequent bioinformatic algorithms, such as "X". 
Alternatively, but at present less preferred, the undesired 
sequence can be excised from the returned genomic sequence, 
leaving gaps . 

5 Preprocessing 24 can further include selection 

from among duplicative sequences of that one sequence of 
highest quality. Higher quality can be measured as a lower 
percentage of, fewest number of, or least densely clustered 
occurrence of ambiguous nucleotides, defined as those 
10 nucleotides that are identified in the genomic sequence 
using symbols indicating ambiguity. Higher quality can 
also or alternatively be valued by presence in the longest 
contig. 

Preprocessing 24 can, and often will, also 

15 include formatting of the data as specifically appropriate 
for passage to the analytical algorithms of process 25. 
Such formatting can and typically will include, inter alia, 
addition of a unique sequence identifier, either derived 
from the original accession number in genomic sequence 

20 database 100, or newly applied, and can further include 
additional annotation. Formatting can include conversion 
from one to another sequence listing standard, such as 
conversion to or from FASTA or the like, depending upon the 
input expected by the subsequent process. 

25 Preprocessing, which can be optional depending 

upon the function desired to be identified and the 
informational requirements of the methods for effecting 
such identification, is followed by sequence processing 25, 
where sequences with the desired function are identified 

30 within the genomic sequence. 

As mentioned above, such functions can include, 
but are not limited to, encoding protein, regulating 
transcription, regulating message transport after 
transcription into mRNA, regulating message splicing after 

35 transcription, of regulating message degradation, and the 
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like. Other functions include directing somatic 
recombination events, contributing to chromosomal stability 
or movement, contributing to allelic exclusion or X 
chromosome inactivation, or the like. 
5 The methods of the present invention are 

particularly useful for gene discovery, that is, for 
identifying, from genomic sequence, regions that function 
to encode genes, and in a particularly useful embodiment, 
for identifying regions that function to encode genes not 

10 hitherto identified by expression-based or directed cloning 
and sequencing. In conjunction with verification using the 
novel single exon microarrays of the present invention, as 
further described below, the methods herein described 
become powerful gene discovery tools. 

15 Accordingly, in a preferred embodiment of the 

present invention, process 25 is used to identify putative 
coding regions. Two preferred approaches in process 25 for 
identifying sequence that encodes putative genes are gene 
prediction and comparative sequence analysis. 

20 Gene prediction can be performed using any of a 

number of algorithmic methods, embodied in one or more 
software programs, that identify open reading frames (ORFs) 
using a variety of heuristics, such as GRAIL, DICTION, and 
GENEFINDER. Comparative sequence analysis similarly can be 

25 performed using any of a variety of known programs that 
identify regions with lower sequence variability. 

As further described in Example 1, below, gene 
finding software programs yield a range of results. For 
the newly accessioned human genomic sequence input in 

30 Example 1, for example, GRAIL identified the greatest 

percentage of genomic sequence as putative coding region, 
2% of the data analyzed; GENEFINDER was second, calling 1%; 
and DICTION yielded the least putative coding region, with 
0.8% of genomic sequence called as coding region. 

35 Increased reliability can be obtained when 
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consensus is required among several such methods. Although 
discussed herein particularly with respect to exon calling, 
consensus among methods will in general increase 
reliability of predicting other functions as well. 
5 Thus, as indicated by query 26, sequence 

processing 25, optionally with preprocessing 24, can be 
repeated with a different method, with consensus among such 
iterations determined and reported in process 27. 

Process 27 compares the several outputs for a 

10 given input genomic sequence and identifies consensus among 
the separately reported results. The consensus itself, as 
well as the sequence meeting that consensus, is then stored 
in process 29a, displayed in process 29b, and/or output to 
process 300 for subsequent identification of a subset 

15 thereof suitable for assay. 

Multiple levels of consensus can be calculated 
and reported by process 27. For example, as further 
described in Example 1, infra, process 27 can report 
consensus as between all specific pairs of methods of gene 

20 prediction, as consensus among any one or more of the pairs 
of methods of gene prediction, or as among all of the gene 
prediction algorithms used. Thus, in Example 1, process 27 
reported that GRAIL and GENEFINDER programs agreed on 0.7% 
of genomic sequence, that GRAIL and DICTION agreed on 0.5% 

25 of genomic sequence, and that the three programs together 
agreed on 0.25% of the data analyzed. Put another way, 
p. 25% of the genomic sequence was identified by all three 
of the programs as containing putative coding region. 

Furthermore, consensus can be required among 

30 different approaches to identifying a chosen function. 

For example, if the function desired to be 
identified is coding of protein sequence, and a first used 
approach to exon calling is gene prediction, the process 
can be repeated on the same input sequence, or subset 

35 thereof, with another approach, such as comparative 
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sequence analysis. In such a case, where comparative 
sequence analysis follows gene prediction, the comparison 
can be performed not only on genomic nucleic acid sequence, 
but additionally or alternatively can be performed on the 
5 predicted amino acid sequence translated from the ORFs 
prior identified by the gene prediction approach. 

Although shown as an iterative process, the 
multiple analyses required to achieve consensus can be done 
in series, in parallel, or some combination thereof. 
10 Predicted functional sequence, optionally 

representing a consensus among a plurality of methods and 
approaches for determination thereof, is passed to process 
300 for identification of a subset thereof for functional 
assay. 

15 In the preferred embodiment of the methods of the 

present invention, wherein the function sought to be 
identified is protein coding, process 300 is used to 
identify a subset thereof suitable for experimental 
verification by physical and/or bioinf ormatic approaches. 

20 For example, putative ORFs identified in process . 

200 can be classified, or binned, bioinf ormatically into 
putative genes. This binning can be based inter alia upon 
consideration of the average number of exons/gene in the 
species chosen for analysis, upon density of exons that 

25 have been called on the genomic sequence, and other 

empirical rules. Thereafter, one or more among the gene- 
specific ORFs can be chosen for subsequent use in gene 
expression assay. 

Where such subsequent gene expression assay uses 

30 amplified nucleic acid, considerations such as desired 
amplicon length, primer synthesis requirements, putative 
exon length, sequence GC content, existence of possible 
secondary structure, and the like can be used to identify 
and select those ORFs that appear most likely successfully 

35 to amplify. Where subsequent gene expression assay relies 
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upon nucleic acid hybridization, whether or not using 
amplified product, further considerations involving 
hybridization stringency can be applied to identify that 
subset of sequences that will most readily permit sequence- 
5 specific discrimination at a chosen hybridization and wash 
stringency. One particular such consideration is avoidance 
of putative exons that span repetitive sequence; such 
sequence can hybridize spuriously to nonspecific message, 
reducing specific signal in the hybridization. 

10 For bioinf ormatic assay, there are fewer 

constraints on the sequences that can be tested 
experimentally, and in this latter case therefore process 
300 can output the entirety of the input sequence. 

The subset of sequences identified by process 300 

15 as suitable for use in assay is then used in process 400 to 
create the physical and/or informational substrate for 
experimental verification of the predictions made in 
process 200, and thereafter to assay those substrates. 
As mentioned, the methods of the present 

20 invention are particularly useful for identifying potential 
coding regions within genomic sequence. In a preferred 
embodiment of process 400, therefore, the expression of the 
sequences predicted to encode protein is verified. The 
combination of the predictive and experimental methods 

25 provides a powerful gene discovery engine. 

Thus, in another aspect, the present invention 
provides methods and apparatus for verifying the expression 
of putative genes identified within genomic sequence. In 
particular, the invention provides a novel method of 

30 verifying gene expression in which expression of predicted 
ORFs is measured and confirmed using a novel type of 
nucleic acid microarray, the genome-derived single exon 
nucleic acid microarrays of the present invention. 

Putative ORFs as predicted by a consensus of gene 

35 calling, particularly gene prediction, algorithms in 
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process 200, and as further identified as suitable by- 
process 300, are amplified from genomic DNA using the 
polymerase chain reaction (PGR) . Although PGR is 
conveniently used, other amplification approaches can also 
5 be used. 

Amplification schemes can be designed to capture 
the entirety of each predicted ORF in an amplicon with 
minimal additional (that is, intronic or intergenic) 
sequence. Because ORFs predicted from human genomic 
10 sequence using the methods of the present invention differ 
in length, such an approach results in amplicons of varying 
length. 

However, most predicted ORFs are shorter than 500 
bp in length, and although amplicons of at least about 100 

15 or 200 base pairs can be immobilized as probes on nucleic 
acid microarrays, early experimental results using the 
methods of the present invention have suggested that longer 
amplicons, at least about 400 or 500 base pairs, are more 
■ effective. Furthermore, certain advantages derive from 

20 application to the microarray of amplicons of defined size. 
Therefore, amplification schemes can 
alternatively, and preferably, be designed to amplify 
regions of defined size, preferably at least about 300, 400 
or 500 bp, centered about each predicted ORF. Such an 

25 approach results in a population of amplicons of limited 
size diversity, but that typically contain intronic and/or 
intergenic nucleic acid in addition to putative ORF. 

Conversely, somewhat fewer than 10% of ORFs 
predicted from human genomic sequence according to the 

30 methods of the present invention exceed 500 bp in length. 
Portions of such extended ORFs, preferably at least about 
300,400 or 500 bp in length, can be amplified. However, it 
has been discovered that the percentage success at 
amplifying pieces of such ORFs is low, and that such 

35 putative exons are more effectively amplified when larger 
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fragments, at least about 1000 or 1500 bp, and even as 
large as 2000 bp are amplified. 

The putative ORFs selected in process 300 are 
thus input into one or more primer design programs, such as 
5 PRIMER3 (available online for use at 

http://www-genorae.wi.mit.edu/cgi-bin/primer/ ), with a goal 
of amplifying at least about 500 base pairs of genomic 
sequence centered within or about ORFs predicted to be no 
more than about 500 bp, or at least about 1000 - 1500 bp of 

10 genomic sequence for ORFs predicted to exceed 500 bp in 

length, and the primers synthesized by standard techniques. 
Primers with the requisite sequences can be purchased 
commercially or synthesized by standard techniques. 

Conveniently, a first predetermined sequence can 

15 be added commonly to the ORF-specific 5' primer and a 
second, typically different, predetermined sequence 
commonly added to each 3' ORF-unique primer. This serves 
to immortalize the amplicon, that is, serves to permit 
further amplification of any amplicon using a single set of 

20 primers complementary respectively to the common 5' and 
common 3' sequence elements. The presence of these 
"universal" priming sequences further facilitates later 
sequence verification, providing a sequence common to all 
amplicons at which to prime sequencing reactions. The 

25 common 5' and 3' sequences further serve to add a cloning 
site should any of the ORFs warrant further study. 

Such predetermined sequence is usefully at least 
about 10, 12 or 15 nt in length, and usually does not 
exceed about 25 nt in length. The "universal" priming 

30 sequences used in the examples presented infra were each 16 
nt long. 

The genomic DNA to be used as substrate for 
amplification will come from the eukaryotic species from 
which the genomic sequence data had originally been 
35 obtained, or a closely related species, and can 
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conveniently be prepared by well known techniques from 
somatic or germline tissue or cultured cells of the 
organism. See, e.g.. Short Protocols in Molecular Biology 
: A Compendium of Methods from Current Protocols in 
5 Molecular Biology , Ausubel fet al. (eds.), 4*^ edition 
(April 1999), John Wiley & Sons (ISBN: 047132938X) and 
Maniatis et al., Molecular Cloning : A Laboratory Manual , 
2"^ edition (December 1989), Cold Spring Harbor Laboratory 
Press (ISBN: 0879693096). Many such prepared genomic DNAs 
10 are available commercially, with the human genomic DNAs 
additionally having certification of donor informed 
consent . 

Although the intronic and intergenic material 
flanking putative coding regions in the amplicons could 

15 potentially interfere with hybridizations during microarray 
experiments, we have found, surprisingly, that differential 
expression ratios are not significantly affected. Rather, 
the predominant effect of exon size is to alter the 
absolute signal intensity, rather than its ratio. Equally 

20 surprising, the art had suggested that single exon probes 
would not provide sufficient signal intensity for high 
stringency hybridization analyses; we find that such probes 
not only provide adequate signal, but have substantial 
advantages, as herein described. 

25 After partial purification, as by size exclusion 

spin column, with or without confirmation as to amplicon 
quality as by gel electrophoresis, each amplicon (single 
exon probe) is disposed in an array upon a support 
substrate . 

30 Methods for creating microarrays by deposition 

and fixation of nucleic acids onto support substrates are 
well known in the art (Reviewed by Schena et al., see 
above) . 

Typically, the support substrate will be glass, 
35 although other materials, such as amorphous or crystalline 
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silicon or plastics. Such plastics include 
polymethylacrylic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, polyvinylchloride, 
polytetraf luoroethylene, polystyrene, polycarbonate, 
5 polyacetal, polysulfone, celluloseacetate, 

cellulosenitrate, nitrocellulose, or mixtures thereof, can 
also be used. Typically, the support will be rectangular, 
although other shapes, particularly circular disks and even 
spheres, present certain advantages. Particularly 

10 advantageous alternatives to glass slides as support 

substrates for array of nucleic acids are optical discs, as 
described in WO 98/12559. 

The amplified nucleic acids can be attached 
covalently to a surface of the support substrate or, more 

15 typically, applied to a derivatized surface in a chaotropic 
agent that facilitates denaturation and adherence by 
presumed noncovalent interactions, or some combination 
thereof. 

Robotic spotting devices useful for arraying 

20 nucleic acids on support substrates can be constructed 
using public domain specifications (The MGuide, version 
2.0, http://cmgm.stanford.edu/pbrown/mguide/index.html), or 
can conveniently be purchased from commercial sources 
(MicroArray Genii Spotter and MicroArray Genlll Spotter, 

25 Molecular Dynamics, Inc., Sunnyvale, CA) . Spotting can 

also be effected by printing methods, including those using 
ink jet technology. 

As is well known in the art, microarrays 
typically also contain immobilized control nucleic acids. 

30 For controls useful in providing measurements of background 
signal for the genome-derived single exon microarrays of 
the present invention, a plurality of E. coli genes can 
readily be used. As further described in Example 1, 16 or 
32 E. coli genes suffice to provide a robust measure of 

35 background noise in such microarrays. 
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As is well known in the art, the amplified 
product disposed in arrays on a support substrate to create 
a nucleic acid microarray can consist entirely of natural 
nucleotides linked by phosphodiester bonds, or 
5 alternatively can include either nonnative nucleotides, 
alternative internucleotide linkages, or both, so long as 
complementary binding can be obtained in the hybridization. 
If enzymatic amplification is used to produce the 
immobilized probes, the amplifying enzyme will impose 

10 certain further constraints upon the types of nucleic acid 
analogs that can be generated. 

Although particularly described herein as using 
high density microarrays constructed on planar substrates, 
the methods of the present invention for confirming the 

rs expression of ORFs predicted from genomic sequence can use 
any of the known types of microarrays, as herein defined, 
including lower density planar arrays, and microarrays on 
nonplanar, nonunitary, distributed substrates. 

For example, gene expression can be confirmed 

20 using hybridization to lower density arrays, such as those 
constructed on membranes, such as nitrocellulose, nylon, 
and positively-charged derivatized nylon membranes. 
Further, gene expression can also be confirmed using 
nonplanar, bead-based microarrays such as are described in 

25 Brenner et al., Proc. Natl. Acad. Sci. USA 97 ( 4 ): 166501670 
(2000); U.S. Patent No. 6,057,107; and U.S. Patent No. 
5,736,330. In theory, a packed collection of such beads 
provides in aggregate a higher density of nucleic acid 
probe than can be achieved with spotting or lithography 

30 techniques on a single planar substrate. 

Planar microarrays on solid substrates, however, 
provide certain useful advantages, including high 
throughput and compatibility with existing readers. For 
example, each standard microscope slide can include at 

35 least 1000, typically at least 2000, preferably 5000 and 
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upto 10,000 - 50,000 or more nucleic acid probes of 
discrete sequence. The number of sequences deposited will 
depend on their required application. 

Each putative gene can be represented in the 
5 array by a single predicted ORF. Alternatively, genes can 
be represented by more than one predicted ORF. For 
purposes of measuring differential splicing, more than one 
predicted ORF will be provided for a putative gene. And as 
is well known in the art, each probe of defined sequence, 

10 representing a single predicted ORF, can be deposited in a 
plurality of locations on a single microarray to provide 
redundancy of signal. 

The genome-derived single exon microarrays 
described above differ in several fundamental and 

15 advantageous ways from microarrays presently used in the 
gene expression art, including (1) those created by 
deposition of raRNA-derived nucleic acids, (2) those created 
by in situ synthesis of oligonucleotide probes, and (3) 
those constructed from yeast genomic DNA. 

20 Most nucleic acid microarrays that are in use for 

study of eukaryotic gene expression have as immobilized 
probes nucleic acids that are derived — either directly or 
indirectly — from expressed message. As discussed above, 
it is common, for example, for such microarrays to be 

25 derived from cDNA/EST libraries, either from those 

previously described in the literature, see Lennon at al., 
or from the de novo construction of "problem specific" 
libraries targeted at a particular biological question, 
R.S. Thomas et al., Cancer Res. (in press). Such 

30 microarrays are herein collectively denominated "EST 
microarrays" . 

Such EST microarrays by definition can measure 
expression only of those genes found in EST libraries, 
shown herein to represent only a fraction of expressed 

35 genes. Furthermore, such libraries — and thus microarrays 
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based thereupon - are biased by the tissue or cell type of 
message origin, by the expression levels of the respective 
genes within the tissues, and by the ability of the message 
successfully to have been reverse-transcribed and cloned. 
5 Thus, as further discussed in Example 1, the 

methods of the present invention enable sequences that do 
not appear in EST or other expression databases to be 
determined - subsequently arrayed for expression 
measurements could not, therefore, have been represented as 

10 probes on an EST microarray. And as further demonstrated 
in the examples, infra, the remaining population of genes 
identified from genomic sequence by the methods of the 
present invention - that is, the one third of sequences 
that had previously been accessioned in EST or other 

15 expression databases — are biased toward genes with higher 
expression levels. 

Representation of a message in an EST and/or cDNA 
library depends upon the successful reverse transcription, 
optionally but typically with subsequent successful 

20 cloning, of the message. This introduces substantial bias 
into the population of probes available for arraying in EST 
microarrays . 

In contrast, neither reverse transcription nor 
cloning is required to produce the probes arrayed on the 

25 genome-derived single exon microarrays of the present 

invention. And although the ultimate deposition of a probe 
on the genome-derived single exon microarray of the present 
invention depends upon a successful amplification from 
genomic material, a priori knowledge of the sequence of the 

30 desired amplicon affords greater opportunity to recover any 
given probe sequence recalcitrant to amplification than is 
afforded by the requirement for successful reverse 
transcription and cloning of unknown message in EST 
approaches . 

35 Thus, the genome-derived single exon microarrays 
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of the present invention present a far greater diversity of 
probes for measuring gene expression, with far less bias, 
than do EST microarrays presently used in the art. 

As a further consequence of their ultimate origin 
5 from expressed message, the probes in EST microarrays often 
contain poly-A (or complementary poly-T) stretches derived 
from the poly-A tail of mature mRNA. These homopolymeric 
stretches contribute to cross-hybridization, that is, to a 
spurious signal occasioned by hybridization to the 

10 homopolymeric tail of a labeled cDNA that lacks sequence 
homology to the. gene-specific portion of the probe. 

In contrast, the probes arrayed in the genome- 
derived single exon microarrays of the present invention 
lack homopolymeric stretches derived from message 

15 polyadenylation, and thus can provide more specific signal. 
Typically, at least about 50, 60 or 75% of the probes on 
the genome-derived single exon microarrays of the present 
invention lack homopolymeric regions consisting of A or T, 
where a homopolymeric region is defined for purposes herein 

20 as stretches of 25 or more, typically 30 or more, identical 
nucleotides . 

A further distinction, which also affects the 
specificity of hybridization, is occasioned by the typical 
derivation of EST microarray probes from cloned material. 

25 Because much of the probe material disposed as probes on 
EST microarrays is excised or amplified from plasmid, 
phage, or phagemid vectors, EST microarrays typically 
include a fair amount of vector sequence, more so when the 
probes are amplified, rather than excised, from the vector. 

30 In contrast, the vast majority of probes in the 

genome-derived single exon microarrays of the present 
invention contain no prokaryotic or bacteriophage vector 
sequence, having been amplified directly or indirectly from 
genomic DNA. Typically, therefore, at least about 50, 60, 

35 70 or 80% or more of individual exon-including probes 
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disposed on a genome-derived single exon microarray of the 
present invention lack vector sequence, and particularly 
lack sequences drawn from plasmids and bacteriophage. 
Preferably, at least about 85, 90 or more than 90% of exon— 
5 including probes in the genome-derived single exon 

microarray of the present invention lack vector sequence. 
With attention to removal of vector sequences through 
preprocessing 24, percentages of vector-free exon-including 
probes can be as high as 95 - 99%. The substantial absence 

10 of vector sequence from the genome-derived single exon 
microarrays of the present invention results in greater 
specificity during hybridization, since spurious cross- 
hybridization to a probe vector sequence is reduced. 

As a further consequence of excision or 

15 amplification of probes from vectors in construction of EST 
microarrays, the probes arrayed thereon often contain 
artificial sequence, derived from vector polylinker 
multiple cloning sites, at both 5' and 3' ends. The probes 
disposed upon the genome-derived single exon microarrays 

20 need have no such artificial sequence appended thereto. 

As mentioned above, however, the ORF-specific 
primers used to amplify putative ORFs can include 
artificial sequences, typically 5' to the ORF-specific 
primer sequence, useful for "universal" (that is, 

25 independent of ORF sequence) priming of subsequent 
amplification or sequencing reactions. When such 
"universal" 5' and/or 3' priming sequences are appended to 
the amplification primers, the probes disposed upon the 
genome-derived single exon microarray will include 

30 artificial sequence similar to that found in EST 

microarrays. However, the genome-derived single exon 
microarray of the present invention can be made without 
such sequences, and if so constructed, presents an even 
smaller amount of nonspecific sequence that would 

35 contribute to nonspecific hybridization. 
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Yet another consequence of typical use of cloned 
material as probes in EST microarrays is that such 
microarrays contain probes that result from cloning 
artifacts, such as chimeric molecules containing coding 
5 region of two separate genes . Derived from genomic 

material, typically not thereafter cloned, the probes of 
the genome-derived single exon microarrays of the present 
invention lack such cloning artifacts, and thus provide 
greater specificity of signal in gene expression 

10 measurements. 

A further consequence of the cloned origin of 
probes on many EST microarrays is that the individual 
probes often have disparate sizes, which can cause the 
optimal hybridization stringency to vary among probes on a 

15 single microarray. In contrast, as discussed above, the 
probes arrayed on the genome-derived single exon 
microarrays of the present invention can readily be 
designed to have a narrow distribution in sizes, with the 
range of probe sizes no greater than about 10% of the 

20 average size, typically no greater than about 5% of the 
average probe size. 

Because of their origin from fully- or partially- 
spliced message, probes disposed upon EST arrays will often 
include multiple exons. The percentage of such exon- 

25 spanning probes in an EST microarray can be calculated, on 
average, based upon the predicted number of exons/gene for 
the given species and the average length of the immobilized 
probes. For human genes, the near-complete sequence of 
human chromosome 22, Dunham efc ai.. Nature 4 02 ( 67 61 ): 4 8 9- 95 

30 (1999), predicts that human genes average 5.5 exons/gene. 
Even with probes of 200 - 500 bp, the vast majority of 
•human EST microarray probes include more than one exon. 

In contrast, by virtue of their origin from 
algorithmically identified ORFs in genomic sequence, the 

35 probes in the genome-derived single exon microarrays of the 
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present invention can consist of individual axons. Thus, 
in contrast to EST microarrays , at least about 50, 60, 70, 
75, 80, 85, 95 or 99% of probes deposited in the genome- 
derived microarray of the present invention consist of, or 
5 include, no more than one predicted ORF. 

This provides the ability, not readily achieved 
using EST microarrays, to use the genome-derived single 
exon microarrays of the present invention to measure 
tissue-specific expression of individual exons, which in 
10 turn allows differential splicing events to be detected and 
characterized, and in particular, allows the correlation of 
differential splicing to tissue-specific expression 
patterns . 

Furthermore, the exons that are represented in 

15 EST microarrays are often biased toward the 3' or 5 ' end of 
their respective genes, since sequencing strategies used 
for EST identification are so biased. In contrast, no such 
3' or 5' bias necessarily inheres in the selection of exons 
for disposition on the genome-derived single exon 

20 microarrays of the present invention. 

Conversely, the probes provided on the genome- 
derived single exon microarrays of the present invention 
typically, but need not necessarily, include intronic 
and/or intergenic sequence that is absent from EST 

25 microarrays, which are derived from mature mRNA. 

Typically, at least about 50, 60, 70, 80 or 90% of the 
exon-including probes on the genome-derived single exon 
microarrays of the present invention include sequence drawn 
from noncoding regions. As discussed above, the additional 

30 presence of noncoding region does not significantly 

interfere with measurement of gene expression, and provides 
the additional opportunity to assay prespliced RNA, and 
thus measure such phenomena such as nuclear export control. 

The genome-derived single exon microarrays of the 

35 present invention are also quite different from in situ 
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synthesis microarrays, where probe size is severely 
constrained by inadequacies in the photolithographic 
synthesis process. 

Typically, probes arrayed on in situ synthesis 
5 microarrays are limited to a maximum of about 25 bp. As a 
well known consequence, hybridization to such chips must be 
performed at low stringency. In order, therefore, to 
achieve unambiguous sequence-specific hybridization 
results, the in situ synthesis raicroarray requires 

10 substantial redundancy, with concomitant programmed 

arraying for each probe of probe analogues with altered 
{i.e., mismatched) sequence. 

In contrast, the longer probe length of the 
genome-derived single exon microarrays of the present 

15 invention allows much higher stringency hybridization and 
wash. Typically, therefore, exon-including probes on the 
genome-derived single exon microarrays of the present 
invention average at least about 100, 200, 300, 400 or 
500 bp in length. By obviating the need for substantial 

20 probe redundancy, this approach permits a higher density of 
probes for discrete exons or genes to be arrayed on the 
microarrays of the present invention than can be achieved 
for in situ synthesis microarrays. 

A further distinction is that the probes in in 

25 situ .synthesis microarrays typically are covalently linked 
to the substrate surface. In contrast, the probes disposed 
on the genome-derived microarray of the present invention 
typically are, but need not necessarily be, bound 
noncovalently to the substrate. 

30 Furthermore, the short probe size on in situ 

microarrays causes large percentage differences in the 
melting temperature of probes hybridized to their 
complementary target sequence, and thus causes large 
percentage differences in the theoretically optimum 

35 stringency across the array as a whole. 
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In contrast, the larger probe size in the 
microarrays of the present invention create lower 
percentage differences in melting temperature across the 
range of arrayed probes . 
5 A further significant advantage of the 

microarrays of the present invention over in situ 
synthesized arrays is that the quality of each individual 
probe can be confirmed before deposition. In contrast, the 
quality of probes cannot be assessed on a probe-by-probe 
10 basis for the in situ synthesized microarrays presently 
being used. 

The genome-derived single exon microarrays of the 
present invention are also distinguished over, and present 
substantial benefits over, the genome-derived microarrays 

15 from lower eukaryotes such as yeast. Lashkari et al., 
Proc. Natl. Acad. Sci . USA 94:13057-13062 (1997). 

Only about 220 - 250 of the 6100 or so nuclear 
genes in Saccharomyces cerevisiae — that is, only about 4 
- 5% — have standard, spliceosomal, introns, Lopez et al., 

20 Nucl. Acids Res. 28:85-86 (2000); Spingola et al., RNA 
5(2):221-34 (1999). Furthermore, the entire yeast genome 
has already been sequenced. These two facts permit the 
ready amplification and disposition of single-ORF amplicons 
on such microarray without the requirement for antecedent 

25 use of gene prediction and/or comparative sequence 
analyses . 

Thus, a significant aspect of the present 
invention is the ability to identify and to confirm 
expression of predicted coding regions in genomic sequence 

30 drawn from eukaryotic organisms that have a higher 

percentage of genes having introns than do yeast such as 
Saccharomyces cerevisiae, particularly in genomic sequence 
drawn from eukaryotes in which at least about 10, 20 or 50% 
of protein-encoding genes have introns. In preferred 

35 embodiments, the methods and apparatus of the present 
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invention are used to identify and confirm expression of 
novel genes from genomic sequence of eukaryotes in which 
the average number of introns per gene is at least about 
one, two or three or more. 
5 After the physical substrate is prepared, 

experimental verification of predicted function is 
performed. 

In a preferred embodiment of the present 
invention, where the function sought to be identified in 

10 genomic sequence is protein coding, experimental 

verification is performed by measuring expression of the 
putative ORFs, typically through nucleic acid hybridization 
experiments, and in particularly preferred embodiments, 
through hybridization to genome-derived single exon 

15 microarrays prepared as above- described. 

Expression is conveniently measured and expressed 
for each probe in the microarray as a ratio of the 
expression measured concurrently in a plurality of mRNA 
sources, according to techniques well known in the 

20 microarray art, Reviewed in Schena et al., and as further 
described in Example 2, below. The mRNA source for the 
reference against which specific expression is measured can 
be drawn from a homogeneous mRNA source, such as a single 
cultured cell-type, or alternatively can be 'heterogeneous , 

25 as from a pool of mRNA derived from multiple tissues and/or 
cell types, as further described in Example 2, infra. 

mRNA can be prepared by standard techniques, see 
Ausubel et al. and Maniatis et al., or purchased 
commercially. The mRNA is then typically reverse- 

30 transcribed in the presence of labeled nucleotides: the 
index source (that in which expression is desired to be 
measured) is reverse transcribed in the presence of 
nucleotides labeled with a first label, typically a 
fluorophore (f luorochrome; fluor; fluorescent dye) ; the 

35 reference source is reverse transcribed in the presence of 
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a second label, typically a fluorophore, typically 
f luorometrically-distinguishable from the first label. As 
further described in Example 2, infra, Cy3 and Cy5 dyes 
prove particularly useful in these methods. After partial 
5 purification of the index and reference targets, 

hybridization to the probe array is conducted according to 
standard techniques, typically under a coverslip. 

After wash, microarrays are conveniently scanned 
using a coiranercial microarray scanning device, such as a 

10 Gen3 Scanner (Molecular Dynamics, Sunnyvale, CA) . Data on 
expression is then passed, with or without interim storage, 
to process 500, where the results for each probe are 
related to the original sequence. 

Often, hybridization of target material to the 

15 genome-derived single exon microarray will identify certain 
of the probes thereon as of particular interest. Thus, it 
is often desirable that the user be able readily to obtain 
sufficient quantities of an individual probe, either for 
subsequent arrayed deposition upon an additional support 

20 substrate, often as part of a microarray having a plurality 
of probes so identified, or alternatively or additionally 
as a solitary solid-phase or solution-phase probe, for 
further use. 

Thus, in another aspect, the present invention 

25 provides compositions and kits for the ready production of 
nucleic acids identical in sequence to, or substantially 
identical in sequence to, probes on the genome-derived 
single exon microarrays of the present invention. 

In this aspect, a small quantity of each probe is 

30 disposed, typically without attachment to substrate, in a 
spatially-addressable ordered set, typically one per well 
of a microtiter dish. Although a 96 well microtiter plate 
can be used, greater efficiency is obtained using higher 
density arrays, such as are provided by microtiter plates 

35 having 384, 864, 1536, 3456, 6144, or 9600 wells, and 
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although microtiter plates having physical depressions 
(wells) are conveniently used, any device that permits 
addressable withdrawal of reagent from fluidly- 
noncommunicating areas can be used. 
5 In this aspect of the invention, therefore, a 

fluidly noncommunicating addressable ordered set of 
individual probes, corresponding to those on a genome- 
derived single exon microarray, is provided, with each 
probe in sufficient quantity to permit airjplif ication, such 

10 as by PGR. As earlier mentioned, the ORF-specific 

5 ' primers used for genomic amplification can have a first 
common sequence added thereto, and the ORF-specific 3' 
■ primers used for genomic amplification can have a second, 
different, common sequence added thereto, thus permitting, 

15 in this preferred embodiment, the use of a single set of 5' 
and 3' primers to amplify any one of the probes from the 
amplifiable ordered set. 

Each discrete amplifiable probe can also be 
packaged with amplification primers, solutes, buffers, 

20 etc., and can be provided in dry (e.g., lyophilized) form 
or wet, in the latter case typically with addition of 
agents that retard evaporation. 

In another aspect of the present invention, a 
genome-derived single-exon microarray is packaged together 

25 with such an ordered set of amplifiable probes 

corresponding to the probes, or one or more subsets of 
probes, thereon. In alternative embodiments, the ordered 
set of amplifiable probes is packaged separately from the 
genome-derived single exon microarray. 

30 In some embodiments, the microarray and/or 

ordered probe set are further packaged with recordable 
media that provide probe identification and addressing 
information, and that can additionally contain annotation 
information, such as gene expression data. Such recordable 

35 media can be packaged with the microarray, with the ordered 
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probe set, or with both. 

If the microarray is constructed on a substrate 
that incorporates recordable media, such as is described in 
international patent application no. WO 98/12559, then 
5 separate packaging of the genome-derived single exon 
microarray and the bioinf ormatic information is not 
required. 

The amount of amplifiable probe material should 
be sufficient to permit at least one amplification 
10 sufficient for subsequent hybridization assay. 

Although the use of high density genome-derived 

microarrays on solid planar substrates is presently a 
preferred approach for the physical confirmation and 
characterization of the expression of sequences predicted 

15 to encode protein, other types of microarrays (as herein 
defined) can also be used. 

Furthermore, as earlier mentioned, experimental 
verification of the function predicted from genomic 
sequence in process 200 can be bioinf ormatic, rather than, 

20 or additional to, physical verification. 

For example, where the function desired to be 
identified is protein coding, the predicted ORFs can be 
compared bioinf ormatically to sequences known or suspected 
of being expressed. 

25 Thus, the sequences output from process 300 (or 

process 200), can be used to query expression databases, 
such as EST databases, SNP ("single nucleotide 
polymorphism") databases, known cDNA and mRNA sequences, 
SAGE ("serial analysis of gene expression") databases, and 

30 more generalized sequence databases that allow query for 
expressed sequences. Such query can be done by any 
sequence query algorithm, such as BLAST ("basic local 
alignment search tool") . The results of such query — 
including information on identical sequences and 

35 information on nonidentical sequences that have diffuse or 
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focal regions of sequence homology to the query sequence — 
can then be passed directly to process 500, or used to 
inform analyses subsequently undertaken in process 200, 
process 300, or process 400. 
5 Experimental data, whether obtained by physical 

or bioinformatic assay in process 400, is passed to process 
500 where it is usefully related to the sequence data 
itself, a process colloquially termed "annotation". Such 
annotation can be done using any technique that usefully 

10 relates the functional information to the sequence, as, for 
example, by incorporating the functional data into the 
record itself, by linking records in a hierarchical or 
relational database, by linking to external databases, or 
by a combination thereof. Such database techniques are 

15 well within the skill in the art. 

The annotated sequence data can be stored 
locally, uploaded to genomic sequence database 100, and/or 
displayed 800. 

The methods and apparatus of the present 

20 invention rapidly produce functional information from 
genomic sequence. Coupled with the escalating pace at 
which sequence now accumulates, the rapid pace of sequence 
annotation produces a need for methods of displaying the 
information in meaningful ways . 

25 FIG. 3 shows visual display 80 presenting a ' 

single genomic sequence annotated according to the present 
invention. Because of its nominal resemblance to artistic 
works of Piet Mondrian, visual display 80 is alternatively 
described herein as a "Mondrian". 

30 Each of the visual elements of display 80 is 

aligned with respect to the genomic sequence being 
annotated (hereinafter, the "annotated sequence") . Given 
the number of nucleotides typically represented in an 
annotated sequence, representation of individual 

35 nucleotides would rarely be readable in hard copy output of 



47 



1 57274 cpc; Fa 

WO 01/57274 PCT/USOl/00666 

display 80. Typically, therefore, the annotated sequence 
is schematized as rectangle 89, extending from the left 
border of display 80 to its right border. By convention 
herein, the left border of rectangle 89 represents the 
5 first nucleotide of the sequence and the right border of 
rectangle 89 represents the last nucleotide of the 
sequence. 

As further discussed below, however, the Mondrian 
visual display of annotated sequence can serve as a 

10 convenient graphical user interface for computerized 

representation, analysis, and query of information stored 
electronically. For such use, the individual nucleotides 
can conveniently be linked to the X axis coordinate of 
rectangle 89. This permits the annotated sequence at any 

15 point within rectangle 89 readily to be viewed, either 

automatically — for example, by time-delayed appearance of 
a small overlaid window upon movement of a cursor or other 
pointer over rectangle 89 — or through user intervention, 
as by clicking a mouse or other pointing device at a point 

20 in rectangle 89. 

Visual display 80 is generated after user 
specification of the genomic sequence to be displayed. 
Such specification can consist of or include an accession 
number for a single clone (e.g., a single BAG accessioned 

25 into GenBank) , wherein the starting and stopping 
nucleotides are thus absolutely identified, or 
alternatively can consist of or include an anchor or 
fulcrum point about which a chosen range of sequence is 
anchored, thus providing relative endpoints for the 

30 sequence to be displayed. For example, the user can anchor 
such a range about a given chromosomal map location, gene 
name, or even a sequence returned by query for similarity 
or identity to an input query sequence. When visual 
display 80 is used as a graphical user interface to 

35 computerized data, additional control over the first and 
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last displayed nucleotide will typically be dynamically 
selectable, as by use of standard zooming and/or selection 
tools. 

Field 81 of visual display 80 is used to present 
5 the output from process 200, that is, to present the 
bioinformatic prediction of those sequences having the 
desired function within the genomic sequence. Functional 
sequences are typically indicated by at least one rectangle 
83 (83a, 83b, B3c) , the left and right borders of which 

10 respectively indicate, by their X-axis coordinates, the 
starting and ending nucleotides of the region predicted to 
have function. 

Where a single bioinformatic method or approach 
identifies a plurality of regions having the desired 

15 function, a plurality of rectangles 83 is disposed 

horizontally in field 81. Where multiple methods and/or 
approaches are used to identify function, each such method 
and/or approach can be represented by its own series of 
horizontally disposed rectangles 83, each such horizontally 

20 disposed series of rectangles offset vertically from those 
representing the results of the other methods and 
approaches . 

Thus, rectangles 83a in FIG. 3 represent the 
functional predictions of a first method of a first 
25 approach for predicting function, rectangles 83b represent 
the functional predictions of a second method and/or second 
approach for predicting that function, and rectangles 83c 
represent the predictions of a third method and/or 
approach. 

30 Where the function desired to be identified is 

protein coding, field 81 is used to present the 
bioinformatic prediction of sequences encoding protein. 
For example, rectangles B3a can represent the results from 
GRAIL or GRAIL II, rectangles 83b can represent the results 

35 from GENEFINDER, and rectangles 83c can represent the 
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results from DICTION. 

Optionally, and preferably, rectangles 83 
collectively representing predictions of a single method 
and/or approach are identically colored and/or textured, 
5 and are distinguishable from the color and/or texture used 
for a different method and/or approach. 

Alternatively, or in addition, the color, hue, 
density, or texture of rectangles 83 can be used further to 
report a measure of the bioinf ormatic reliability of the 

10 prediction. For example, many gene prediction programs 
will report a measure of the reliability of prediction. 
Thus, increasing degrees of such reliability can be 
indicated, e.g., by increasing density of shading. Where 
display 80 is used as a graphical user interface, such 

15 measures of reliability, and indeed all other results 

output by the program, can additionally or alternatively be 
made accessible through linkage from individual rectangles 
83, as by time-delayed window ("tool tip" window), or by 
pointer (e.g., mouse) -activated link. 

20 As earlier described, increased predictive 

reliability can be achieved by requiring consensus among 
methods and/or approaches to determining function. Thus, 
field 81 can include a horizontal series of rectangles 83 
that indicate one or more degrees of consensus in 

25 predictions of function. 

Although FIG. 3 shows three series of 
horizontally disposed rectangles in field 81, display 80 
can include as few as one such series of rectangles and as 
many as can discriminably be displayed, depending upon the 

30 number of methods and/or approaches used to predict a given 
function . 

Furthermore, field 81 can be used to show 
predictions of a plurality of different functions. 
However, the increased visual complexity occasioned by such 
35 display makes more useful the ability of the user to select 
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a single function for display. When display 80 is used as 
a graphical user interface for computer query and analysis, 
such function can usefully be indicated and user- 
selectable, as by a series of graphical buttons or tabs 
5 (not shown in FIG. 3) . 

Rectangle 89 is shown in FIG. 3 as including 
interposed rectangle 84. Rectangle 84 represents the 
portion of annotated sequence for which predicted 
functional information has been assayed physically, with 

10 the starting and ending nucleotides of the assayed material 
indicated by the X axis coordinates of the left and right 
borders of rectangle 84. Rectangle 85, with optional 
inclusive circles 86 (86a, 86b, and 86c) displays the 
results of such physical assay. 

15 Although a single rectangle 84 is shown in FIG. 

3, physical assay is not limited to just one region of 
annotated genomic sequence. It is expected that an 
increasing percentage of regions predicted to have function 
by process 200 will be assayed physically, and that display 

20 80 will accordingly, for any given genomic sequence, have 
an increasing number of rectangles 84 and 85, representing 
an increased density of sequence annotation. 

Where the function desired to be identified is 
protein coding, rectangle 84 identifies the sequence of the 

25 probe used to measure expression. In embodiments of the 
present invention where expression is measured using 
genome-derived single exon microarrays, rectangle 84 
identifies the sequence included within the probe 
immobilized on the support surface of the microarray. As 

30 noted supra, such probe will often include a small amount 
of additional, synthetic, material incorporated during 
amplification and designed to permit reamplif ication of the 
probe, which sequence is typically not shown in display 80. 

Rectangle 87 is used to present the results of 

35 bioinf ormatic assay of the genomic sequence. For example, 
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where the function desired to be identified is protein 
coding, process 400 can include bioinf ormatic query of 
expression databases with the sequences predicted in 
process 200 to encode exons. And as earlier discussed, 
5 because bioinf ormatic assay presents fewer constraints than 
does physical assay, often the entire output of process 200 
can be used for such assay, without further subsetting 
thereof by process 300. Therefore, rectangle 87 typically 
need not have separate indicators therein of regions 

10 submitted for bioinf ormatic assay; that is, rectangle 87 
typically need not have regions therein analogous to 
rectangles 84 within rectangle 89. 

Rectangle 87 as shown in FIG. 3 includes smaller 
rectangles 880 and 88. Rectangles 880 indicate regions 

15 that returned a positive result in the bioinf ormatic assay, 
with rectangles 88 representing regions that did not return 
such positive results. Where the function desired to be 
predicted and displayed is protein coding, rectangles 880 
indicate regions of the predicted exons that identify 

20 sequence with significant similarity in expression 
databases, such as EST, SNP, SAGE databases, with 
rectangles 88 indicating genes novel over those identified 
in existing expression data bases. 

Rectangles 880 can further indicate, through 

25 color, shading, texture, or the like, additional 
information obtained from bioinf ormatic assay. 

For example, where the function assayed and 
displayed is protein coding, the degree of shading of 
rectangles 880 can be used to represent the degree of 

30 sequence similarity found upon query of expression 

databases. The number of levels of discrimination can be 
as few as two (identity, and similarity, where similarity 
has a user-selectable lower threshold) . Alternatively, as 
many different levels of discrimination can be indicated as 

35 can visually be discriminated. 
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Where display 80 Is used as a graphical user 
interface, rectangles 880 can additionally provide links 
directly to the sequences identified by the query of 
expression databases, and/or statistical summaries thereof. 
5 As with each of the precedlngly-discussed uses of display 
80 as a graphical user interface, it should be understood 
that the information accessed via display 80 need not be ' 
resident on the computer presenting such display, which 
often will be serving as a client, with the linked 
10 information resident on one or more remotely located 
servers . 

Rectangle 85 displays the results of physical 
assay of the sequence delimited by its left and right 
borders . 

15 Rectangle 85 can consist of a single rectangle, 

thus indicating a single assay, or alternatively, and 
increasingly typically, will consist of a series of 
rectangles (85a, 85b, 85c) indicating separate physical 
assays of the same sequence. 

20 Where the function assayed is gene expression, 

and where gene expression is assayed as herein described 
using simultaneous two-color fluorescent detection of 
hybridization to genome-derived single exon microarrays, 
individual rectangles 85 can be colored to indicate the 

25 degree of expression relative to control. Conveniently, 
shades of green can be used to depict expression in the 
sample over control values, and shades of red used to 
depict expression less than control, corresponding to the 
spectra of the Cy3 and Cy5 dyes conventionally used for 

30 respective labeling thereof. Additional functional 

information can be provided in the form of circles 8 6 (8 6a, 
86b, 86c), where the diameter of the circle can be used to 
indicate expression intensity. As discussed infra, such 
relative expression (expression ratios) and absolute 

35 expression (signal intensity) can be expressed using 
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normalized values. 

Where display 80 is used as a graphical user 
interface, rectangle 85 can be used as a link to further 
information about the assay. For example, where the assay 
5 is one for gene expression, each rectangle 85 can be used 
to link to information about the source of the hybridized 
mRNA, the identity of the control, raw or processed data 
from the microarray scan, or the like. 

FIG. 4 is rendition of display 80 representing 

10 gene prediction and gene expression for a hypothetical BAG, 
showing conventions used in the Examples presented infra. 
BAG sequence ("Ghip seq.") 89 is presented, with the 
physically assayed region thereof (corresponding to 
rectangle 84 in FIG. 3) shown in white. Algorithmic gene 

15 predictions are shown in field 81, with predictions by 

GRAIL shown, predictions by GENEFINDER, and predictions by 
DIGTION shown. Within rectangle 87, regions of sequence 
that, when used to query expression databases, return 
identical or similar sequences ("EST hit") are shown as 

20 white rectangles (corresponding to rectangles 880 in FIG. 
3), gray indicates low homology, and black indicates 
unknowns (where black and gray would correspond to 
rectangles 88 in FIG. 3). 

Although FIGS. 3 and 4 show a single stretch of 

25 sequence, uninterrupted from left to right, longer 

sequences are usefully represented by vertical stacking of 
such individual Mondrians, as shown in FIGS. 9 and 10. 



Single Exon Probes Useful For Measuring Gene Expression 

30 

The methods and apparatus of the present 
invention rapidly produce functional information from 
genomic sequence. Where the function to be identified is 
protein coding, the methods and apparatus of the present 
35 invention rapidly identify and confirm the expression of 
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portions of genomic sequence that function to encode 
protein. As a direct result, the methods and apparatus of 
the present invention rapidly yield large numbers of 
single-exon nucleic acid probes, the majority from 
5 previously unknown genes, each of which is useful for 

measuring and/or surveying expression of a specific gene in 
one or more tissues or cell types. 

It is, therefore, another aspect of the present 
invention to provide genome-derived single exon nucleic 

10 acid probes useful for gene expression analysis, and 

particularly for gene expression analysis by microarray. 

Using the methods and genome-derived single-exon 
microarrays of the present invention, we have for example 
readily identified a large number of unique ORFs from human 

15 genomic sequence. Using single exon probes that encompass 
these ORFs, we have demonstrated, through microarray 
hybridization analysis, the expression of 9,980 of these 
ORFs in heart. 

As would immediately be appreciated by one of 

20 skill in the art, each single exon probe having 

demonstrable expression in heart is currently available for 
use in measuring the level of its ORF's expression in 
heart. 

Diseases of the heart and vascular system are a 
25 significant cause of human morbidity and mortality. 
Increasingly, genetic factors are being found that 
contribute to predisposition, onset, and/or aggressiveness 
of most, if not all, of these diseases. Although mutations 
in single genes have on occasion been identified as 
30 causative, these disorders are for the most part believed 
to have polygenic etiologies. 

For example, cardiovascular disease (CVD) , which 
includes coronary heart disease, stroke, and peripheral 
arterial vascular disease, is the leading cause of death in 
35 the United States and other developed countries . In 
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developing regions, coronary heart disease and stroke are 
ranked second and third, respectively, as causes of 
mortality. In the United States alone, about 1 million 
deaths (about 42% of total deaths per year) result from CVD 
5 each year. CVD is also a significant cause of morbidity, 
with about 1.5 million people suffering myocardial 
infarction, and about 500,000 suffering strokes in the 
United States each year. With risk for CVD increasing with 
age, and an increasingly aging population, CVD will 

10 continue to be a major health problem into the future. 

CVD is caused by arterial lesions that begin as 
fatty streaks, which consist of lipid-laden foam cells, and 
develop into fibrous plaques. The atherosclerotic plaque 
may grow slowly, ' and over several decades may produce a 

15 severe stenosis or result in arterial occlusion. Some 
plaques are stable, but other, more unstable, ones may 
rupture and induce thrombosis. The thrombi may embolize, 
rapidly occluding the lumen and leading to myocardial 
infarction or acute ischemic syndrome. 

20 Risk factors for CVD include age and gender. In 

addition, a family history of CVD significantly increases 
risk, indicating a genetic basis for development of this 
disease complex. Obesity, especially truncal obesity, the 
cause of which is suspected to be genetic, is yet another 

25 risk factor for CVD. Familial disorders such as 
hyper lipidemia, hypoalphalipoproteinemia, 
hypertriglyceridemia, hypercholesterolemia, 
hyperinsulinemia, homocystinuria, and 

dysbetalipoproteinemia, all of which lead to lipid or 
30 lipoprotein abnormalities, can predispose one to the 
development of CVD. Both insulin-dependent and non- 
insulin-dependent diabetes mellitus, both of which have 
genetic components, have been also linked to the 
development of atherosclerosis. 
35 The literature is replete with evidence for 
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genetic causes of cardiovascular diseases. For example, 
studies by Allayee et al.. Am. J. Hum. Genet. 63:577- 
585(1998), indicated a genetic association between familial 
combined hyperlipidemia (FCHL) and small dense LDL 
5 particles. The studies also concluded that the genetic 
determinants for LDL particle size are shared, at least in 
part, among FCHL families and the more general population 
at risk for CVD. Juo et al . , Am. J. Hum. Genet. 63: 58 6- 
594 (1998) demonstrated that small, dense LDL particles and 

10 elevated apolipoprotein B levels, both of which are 

commonly found in members of FCHL families, share a common 
major gene plus individual polygenic components. 

The common major gene was estimated to explain 37% of 
the variants of adjusted LDL particle size and 23% of the 

15 variants of adjusted apoB levels. 

The atherogenic lipoprotein phenotype (ALP) is a 
common heritable trait, symptoms of which include a 
prevalence of small, dense LDL particles, increased levels 
of triglyceride-rich lipoproteins, reduced levels of high 

20 density lipoprotein, and increased risk of CVD, 

particularly myocardial infarction. Both Nishina et al . , 
Proc. Nat. Acad. Sci. 89: 708-712 (1992) and Rotter et al . , 
Am. J. Hum. Genet. 58: 585-594(1996) demonstrated linkage 
between ALP and the LDLR locus. Rotter et al . , supra, also 

25 reported linkage to the CETP locus on chromosome 16 and to 
the SODl locus on chromosome 6, and possibly also to the 
APOA1/APOC3/APOA4 cluster on chromosome 11. 

Mutations in genes identified as components of 
lipid metabolism, e.g., apolipoprotein E (apoE) and LDL 

30 receptor (LDLR) , have been shown to be associated with 
predisposition to the development of CVD. For example, 
several apoE variants had been found to be associated with 
familial dysbetalipoproteinemia, characterized by elevated 
plasma cholesterol and triglyceride levels and an increased 

35 risk for atherosclerosis (de Knijff et al . , Mutat 4: 178- 
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194 (1994)). Mutations in the LDLR gene have been 
associated with the familial hypercholesterolemia, an 
autosomal dominant disorder characterized by elevation of 
serum cholesterol bound to low density lipoprotein (LDL) , 
5 that can lead to increased susceptibility to CVD. 

To date, mutations in numerous genes have been 
shown to be associated with increased CVD susceptibility. 
However, the identified genetic associations are believed 
not to account for all genetic contributions to CVD. 

10 As yet another example, hypertension is a major 

health problem because of its high prevalence and its 
association with increased risk of CVD. Approximately 25% 
of all adults and over 60% of persons older than 60 years 
in the United States have high blood pressure. 

15 Arterial or systemic hypertension is diagnosed 

when the average of two or more diastolic BP measurements 
on at least two subsequent visits is 90 mm Hg or more, or 
when the average of multiple systolic BP readings on two or 
more subsequent visits is consistently greater than 140 mm 

20 Hg. Pulmonary hypertension is defined as pressure within 
the pulmonary arterial system elevated above the normal 
range; pulmonary hypertension may lead to right ventricle 
(RV) failure. 

Hypertension, together with other cardiovascular 

25 risk factors, leads to atherosclerosis and other forms of 
CVD, primarily by damaging the vascular endothelium. In 
more than 40% of the U.S. population, hypertension is 
accompanied by hyperlipidemia and leads to the development 
of atherosclerotic plaques. In the absence of 

30 hyperlipidemia, intimal thickening occurs. Non- 
atherosclerotic hypertension-induced vascular damage can 
lead to stroke or heart failure. 

Familial diseases associated with secondary 
hypertension include familial renal disease, polycystic 

35 kidney disease, medullary thyroid cancer, pheochromocytoma, 
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and hyperparathyroidism. Hypertension is also twice as 
common in patients with diabetes mellitus. 

More than 95% of all hypertension cases are 
essential hypertension, that is, lack identifiable 
5 antecedent clinical cause. Essential hypertension shows 
clustering in families and can result from a variety of 
genetic diseases. In most cases, high blood pressure 
results from a complex interaction of factors with both " 
genetic and environmental components. The recent search 

10 for genes that contribute to the development of essential 
hypertension has shown that the disorder is polygenic in 
origin. However, with several exceptions (such as 
angiotensinogen, angiotensin receptor-1, beta-3 subunit of 
guanine nucleotide-binding protein, tumor necrosis factor 

15 ■ receptor-2, and "-adducin) , the particular genes involved 
are still being sought. 

Susceptibility loci for essential hypertension 
have been mapped to chromosomes 17 and 15q. Hasstedt et 
al., Am. J. Hum. Genet. 43: 14-22 (1988) measured red cell 

20 sodium in 1,800 normotensive members of 16 Utah pedigrees 
ascertained through hypertensive or normotensive probands, 
siblings with early stroke death, or brothers with early 
coronary disease, and suggested that red blood cell sodium 
was determined by 4 alleles at a single locus. This major 

25 locus was thought to explain 29% of the variance in red 
cell sodium, and polygenic inheritance explained another 
54.6%. A higher frequency of the high red blood cell sodium 
genotype in pedigrees in which the proband was hypertensive 
rather than normotensive provided evidence that this major 

30 locus increases susceptibility to hypertension. 

From a study of systolic blood pressure in 278 
pedigrees, Perusse et al . , Am. J. Hum. Genet. 49: 94-105 
(1991) reported that variability in systolic blood pressure 
is likely influenced by allelic variation of a single gene, 

35 with gender and age dependence. They also suggested that a 
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single gene may be associated with a steeper increase of 
blood pressure with age among males and females. 

There is strong evidence, however, for additional 
as yet uncharacterized, hypertension-associated loci on 
5 other chromosomes. 

For example, Xu et al.. Am. J. Hum. Genet. 64: 
1694-1701 (1999) carried out a systematic search for 
chromosomal regions containing genes that regulate blood 
pressure by scanning the entire autosomal genome using 367 

10 polymorphic markers. Because of the sampling design, the 
number of sib pairs, and the availability of genotyped 
parents, this study represented one of the most powerful of 
its kind. Although no regions achieved a 5% genomewide 
significance level, maximum lod scores were greater than 

15 2.0 for regions of chromosomes 3, 11, 15, 16, and 17. 

As another example, cardiac arrhythmias account 
for several thousand deaths each year. Arrhythmias such as 
ventricular fibrillation, which causes more than 300,000 
sudden deaths annually in the United States alone, 

20 encompass a multitude of disorders. Another type of 

arrhythmia, idiopathic dilated cardiomyopathy, of which 
familial dilated cardiomyopathy accounts for 20-25%, is 
responsible for more than 10,000 deaths in the United 
States annually and is the predominant indication for 

25 cardiac transplantation. 

Cardiac arrhythmias can be divided into 
bradyarrhythmias (slowed rhythms) or tachyarrhythmias 
(speeded rhythms) . Bradyarrhythmias result from 
abnormalities of intrinsic automatic behavior or 

30 conduction, primarily within the atrioventricular node and 
the His-Purkin j e ' s network. Tachyarrhythmias are caused by 
altered automaticity, reentry, or triggered automaticity . 

Bradyarrhythmias arising from suspected polygenic 
disorders include Long QT syndrome 4, atrioventricular 

35 block, familial sinus node disease, progressive cardiac 
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conduction defect, and familial cardiomyopathy. 
Tachyarrhythmias with possible underlying polygenic causes 
include familial ventricular tachycardia, Wolff-Parkinson- 
White syndrome, familial arrhythmogenic right ventricular 
5 dysplasia, heart-hand syndrome V, Mai de Meleda, familial 
ventricular fibrillation, and familial noncompaction of 
left ventricular myocardium. 

For some of the arrhythmias, one or more of the 
causative genes have been identified. 

10 For example, atrioventricular block has been 

associated with mutations in the SCN5A gene, as well as 
mutations in a locus mapped to 19ql3. Studies have shown 
linkage of familial sinus node disease to a marker on 
10q22-q24. Familial ventricular tachycardia has been 

15 linked to mutations in genes encoding the G protein subunit 
alpha-i2 (GNAIl) , and/or related genes. Examination of 
families with Wolff-Parkinson-White syndrome suggest an 
autosomal dominant pattern of inheritance and evidence of 
linkage of the disorder to DNA markers on band 7q3. 

20 Linkage analysis shows strong evidence for localization of 
a gene for Mai de Meleda disease on 8qter. Familial 
ventricular fibrillation can be caused by mutations in the 
cardiac sodium channel gene SCN5A. Familial noncompaction 
of left ventricular myocardiurr. has been linked to mutations 

25 in the gene encoding tafazzin (TAZ) , or in the FK506- 
binding protein lA gene (FKBPIA) . 

Familial dilated cardiomyopathy is characterized 
by an autosomal dominant pattern of inheritance with age- 
related penetrance. The linkage of familial dilated 

30 cardiomyopathy to several loci indicate that it is 

polygenic. These loci include CMDIA on Ipll-qll, CMDIB on 
9ql3, CMDIC on 10q21, CMDID on lq32, CMDIE on 3p, CMDlF on 
6q, CMDIG on 2q31, CMDIH on 2ql4-q22, and CMDlI, which 
results from mutation in the DES gene on 2q35. 

35 In addition, cardiomyopathy can also be caused by 
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mutations in the ACTC gene, the cardiac beta-myosin heavy 
chain gene (MYH7), or the cardiac troponin T gene. 

Familial arrhythmogenic right ventricular 
dysplasia is inherited as an autosomal dominant with 
5 reduced penetrance and is one of the major genetic causes 
of juvenile sudden death. It is estimated that the 
prevalence of familial arrhythmogenic right ventricular 
dysplasia ranges from 6 per 10,000 in the general 
population to 4.4 per 1,000 in some areas. 

10 Several loci for familial arrhythmogenic right 

ventricular dysplasia have been mapped indicating that this 
disease is al,so polygenic in nature. These loci include 
ARVDl on 14q23-q24, ARVD2" on Iq42-q43, ARVD3 on 14ql2-q22, 
ARVD4 on 2q32.1-q32.3, ARVD5 on 3p23, and ARVD6 on 10pl4- 

15 pl2. 

Progressive cardiac conduction defect (PCCD) , 
also called Lenegre-Lev disease, is one of the most common 
cardiac conduction diseases. It is characterized by 
progressive alteration of cardiac conduction through the 

20 His-Purkinje system with right or left bundle branch block 
and widening of QRS complexes, leading to complete 
atrioventricular block and ultimately causing syncope and 
sudden death. It represents the major cause of pacemaker 
implantation in the world (0.15 implantations per 1,000 

25 inhabitants per year in developed countries) . The cause of 
PCCD is unknown but familial cases with right bundle branch 
block have been reported suggesting that at least some 
cases are of genetic origin. Reports have linked PCCD to 
HBl on 19ql3.3, and to mutations in the SCN5A gene (Schott 

30 et al., Nature Genet. 23: 20-21 (1999)). 

As yet a further example, congenital heart 
disease occurs at a rate of 8 per 1000 live births, which 
corresponds to approximately 32,000 infants with newly 
diagnosed congenital heart disease each year in the United 

35 States. Twenty percent of infants with congenital heart 
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disease die within the first year of life. Approximately 
80% of the first-year survivors live to reach adulthood. 
Congenital heart disease also has economic impact due to 
the estimated 20,000 surgical procedures performed to 
5 correct circulatory defects in these patients. The 

estimated number of adults with congenital heart disease in 
the United States is currently about 900,000. 

In 90% of patients, congenital heart disease is 
attributable to multifactorial inheritance. Only 5-10% of 

10 malformations are due to primary genetic factors, which are 
either chromosomal or a result of a single mutant gene. 

The most common congenital heart disease found in 
adults is bicuspid aortic valve. This defect occurs in 2% 
of the general population and accounts for approximately 

15 50% of operated cases of aortic stenosis in adults. Atrial 
septal defect is responsible for 30-40% of congenital heart 
disease seen in adults. The most common congenital cardiac 
defect observed in the pediatric population is ventricular 
septal defect, which accounts for 15-20% of all congenital 

20 lesions. Tetralogy of Fallot is the most common cyanotic 
congenital anomaly observed in adults. Other congenital 
heart diseases include Eisenmenger ' s syndrome, patent 
ductus arteriosus, pulmonary stenosis, coarctation of the 
aorta, transposition of the great arteries, tricuspid 

25 atresia, univentricular heart, Ebstein's anomaly, and 
double-outlet right ventricle. 

A n\imber of studies have identified putative 
genetic loci associated with one or more congenital heart 
diseases . 

30 Congenital heart disease affects more than 40% of 

all Down syndrome patients. The candidate chromosomal 
region containing the putative gene or genes for congenital 
heart disease associated with Down syndrome is 21q22.2- 
q22.3, between ETS2 and MXl. 

35 DiGeorge syndrome (DGS) is characterized by 
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several symptoms including outflow tract defects of the 
heart such as teratology of Fallot. Most cases result from 
a deletion of chromosome 22qll.2 (the DiGeorge syndrome 
chromosome region, or DGCR) . The 22qll deletion is the 
5 second most common cause of congenital heart disease after 
Down syndrome. Several genes are lost in this deletion 
including the putative transcription factor TUPLEl. This 
deletion is associated with a variety -of phenotypes, e.g., 
Shprintzen syndrome; conotruncal anomaly face (or Takao 

10 syndrome) ; and isolated outflow tract defects of the heart 
including Tetralogy of Fallot, truncus arteriosus, and 
interrupted aortic arch. 

Whereas 90% of case-s of DGS may now be attributed 
to a 22qll deletion, other associated chromosome defects 

15 have been identified. For example, Greenberg et al . , Am. 
J. Hum. Genet. 43:605-611 (1988), reported 1 case of DGS 
with dellOplB and one with a 18q21.33 deletion. Fukushima 
et al.. Am. J. Hum. Genet. 51 (suppl.):A80 (1992) reported 
linkage with a deletion of 4q21.3-q25. Gottlieb et al., 

20 Am. J. Hum, Genet. 62: 495-498 (1998) concluded that the 
deletion of more than 1 region on lOp could be associated 
with the DGS phenotype. The association of the DiGeorge 
syndrome with at least 2 and possibly more chromosomal 
locations suggests strongly the involvement of several 

25 genes in this disease. 

Digilio et al., J. Med. Genet. 34: 188-190 
(1997), calculated empiric risk figures for recurrence of 
isolated Tetralogy of Fallot in families after exclusion of 
del(22qll), and concluded that gene(s) different from those 

30 located on 22qll must be involved in causing familial 

aggregation of nonsyndromic Tetralogy of Fallot. Johnson 
et al.. Am. J. Med. Genet. (1997) conducted a cytogenetic 
evaluation of 159 cases of Tetralogy of Fallot. They 
reported that a del(22qll) was identified in 14% who 

35 underwent fluorescence in situ hybridization (FISH) testing 
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,,with the N25 cosmid probe. 

Other congenital heart disease are also suspected 
to b^i of polygenic origin. For example. Holmes et al . , 
Birth Defects Orig. Art. Ser. X(4): 228-230 (1974) 
5 described familial clustering of hypoplastic left heart 
syndrome in siblings consistent with multifactorial 
causation. 

Other significant diseases of the heart and 
vascular system are also believed to have a genetic, 

10 typically polygenic, etiological component. These diseases 
include, for example, hypoplastic left heart syndrome, 
cardiac valvular dysplasia, Pfeiffer cardiocranial 
syndrome, oculof aciocardiodental syndrome, Kapur-Toriello 
syndrome, Sonoda syndrome, Ohdo Blepharophimosis syndrome, 

15 heart-hand syndrome, Pierre-Robin syndrome, Hirschsprung 
disease, Kousseff syndrome. Grange occlusive arterial 
syndrome, Kearns-Sayre syndrome, Kartagener syndrome, 
Alagille syndrome, Ritscher-Schinzel syndrome, Ivemark 
syndrome, Young-Simpson syndrome, hemochromatosis, 

20 Holzgreve syndrome, Barth syndrome, Smith-Lemli-Opitz 

syndrome, glycogen storage disease, Gaucher-like disease, 
Fabry disease, Lov/ry-Maclean syndrome, Rett syndrome, Opitz 
syndrome, Marfan syndrome, Miller-Dieker lissencephaly 
syndrome, mucopolysaccharidosis, Bruada syndrome, 

25 humerospinal dysostosis, Phaver syndrome, McDonough 
syndrome, Marfanoid hypermobility syndrome, 
atransf errinemia, Cornelia de Lange syndrome. Leopard 
syndrome, Diamond-Blackf an anemia, Steinfeld syndrome, 
progeria, and Williams-Beuren syndrome. 

30 The human genome-derived single exon nucleic acid 

probes and microarrays of the present invention are useful 
for predicting, diagnosing, grading, staging, monitoring 
and prognosing diseases of human heart and vascular system, 
particularly those diseases with polygenic etiology. With 

35 each of the single exon probes described herein shown to be 
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expressed at detectable levels in human heart, and with 
about 2/3 of the probes identifying novel genes, the single 
exon microarrays of the present invention provide 
exceptionally high informational content for such studies. 
5 For example, diagnosis (including differential 

diagnosis among clinically Indistinguishable disorders), 
staging, and/or grading of a disease can be based upon the 
quantitative relatedness of a patient gene expression 
profile to one or more reference expression profiles known 

10 to be characteristic of a given heart or vascular disease, 
or to specific grades or stages thereof. 

In one embodiment, the patient gene expression 
profile is generated by hybridizing nucleic acids obtained 
directly or indirectly from transcripts expressed in the 

15 patient's heart or vascular tissues to the genome-derived 
single exon microarray of the present invention. Reference 
profiles are obtained similarly by hybridizing nucleic 
acids obtained directly or indirectly from transcripts 
expressed in heart or vascular tissue of individuals with 

20 known disease. Methods for quantitatively relating gene 
expression profiles, without regard to the function of the 
protein encoded by the gene, are disclosed in WO 99/58720, 
incorporated herein by reference in its entirety. 

In another approach, the genome-derived single 

25 exon probes and microarrays of the present invention can be 
used to interrogate genomic DNA, rather than pools of 
expressed message; this latter approach permits 
predisposition to and/or prognosis of heart or vascular 
disease to be assessed through the massively parallel 

30 determination of altered copy number, deletion, or mutation 
in the patient's genome of exons known to be expressed in 
human heart. The algorithms set forth in WO 99/58720 can 
be applied to such genomic profiles without regard to the 
function of the protein encoded by the interrogated gene. 

35 The utility is specific to the probe; at 
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sufficiently high hybridization stringency, which 
stringencies are well known in the art — see Ausubel et al . 
and Maniatis et al. — each probe reports the level of 
expression of message specifically containing that ORF. 



probes of the present invention, for which expression in 
the heart has been demonstrated are useful for both 
measurement in the heart and for survey of expression in 
other tissues. 



of probes for novel genes. 

As mentioned above and further detailed in 
Examples 1 and 2, the methods described enable ORFs which 
are not present in existing expression databases to be 

15 identified. And the fewer the number of tissues in which 
the ORF can be shown to be expressed, the more likely the 
ORF will prove to be part of a novel gene: as further 
discussed in Example 2, ORFs whose expression was 
measurable in only a single of the tested tissues were 

20 represented in existing expression databases at a rate of 
only 11%, whereas 36% of ORFs whose expression was 
measurable in 9 tissues were present in existing expression 
databases, and fully 45% of those ORFs expressed in all ten 
tested tissues were present in existing expressed sequence 

25 databases. 

Either as tools for measuring gene expression or 
tools for surveying gene expression, the genome-derived 
single exon probes of the present invention have 
significant advantages over the cDNA or EST-based probes 

30 that are currently available for achieving these utilities. 

The genome-derived single exon probes of the 
present invention are useful in constructing genome-derived 
single exon microarrays / the genome-derived single exon 
microarrays, in turn, are useful devices for. measuring and 

35 for surveying gene expression in the human. 
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It should be appreciated, however, that the 
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Significant among such advantages is the presence 
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Gene expression analysis using microarrays — 
conventionally using microarrays having probes derived from 
expressed message — is well-established as useful in the 
biological research arts (see Lockhart et al. Nature 405, 
5 827-836) . 

Microarrays have been used to determine gene 
expression profiles in cells in response to drug treatment 
(see, for example, Kaminski et al., "Global Analysis of 
Gene Expression in Pulmonary Fibrosis Reveals Distinct 

10 Programs Regulating Lung Inflammation and Fibrosis," Proc, 
Natl. Acad. Sci. USA 97 ( 4 ): 1778-83 (2000); Bartosiewicz et 
al., "Development of a Toxicological Gene Array and 
Quantitative Assessment of This Technology," Arch. Biochem. 
Biophys. 376(1): 66-73 (2000)), viral infection (see for 

15 example, Geiss et al., "Large-scale Monitoring of Host Cell 
Gene Expression During HIV-1 Infection Using cDNA 
Microarrays," Virology 266(1): 8-15 (2000)) and during cell 
processes such as dif f erentiation, senescence and apoptosis 
(see, for example, She! ton et a J . , "Microarray Analysis of 

20 Replicative Senescence," Curr. Biol. 9(17): 939-45 (1999); 
Voehringer et ai . , "Gene Microarray Identification of Redox 
and Mitochondrial Elements That Control Resistance or 
Sensitivity to Apoptosis," Proc. Natl. Acad. Sci. USA 
97 (6) :2680-5 (2000) ) . 

25 Microarrays have also been used to determine 

abnormal gene expression in diseased tissues (see, for 
example, Alon et al., "Broad Patterns of Gene Expression 
Revealed by Clustering Analysis of Tumor and Normal Colon 
Tissues Probed by Oligonucleotide Arrays," Proc. Natl. 

30 Acad. Sci. USA 96 (12) : 5745-50 (1999); Perou et al . , 

"Distinctive Gene Expression Patterns in Human Mammary 
Epithelial Cells and Breast Cancers, Proc. Natl. Acad. Sci. 
USA 96 (16) : 9212-7 (1999); Wang et ai., "Identification of 
Genes Differentially Over-expressed in Lung Squamous Cell 

35 Carcinoma Using Combination of cDNA Subtraction and 
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Microarray Analysis, " Oncog-ene 19 ( 12 ): 1519-28 (2000); 
Whitney et al., "Analysis of Gene Expression in Multiple 
Sclerosis Lesions Using cDNA Microarrays , " Ann. Neurol. 
46(3): 425-8 (1999)), in drug discovery screens (see, for 
5 example, Scherf et al . , "A Gene Expression Database for the 
Molecular Pharmacology of Cancer," Nat. Genet. 24(3):236-44 
(2000)) and in diagnosis to determine appropriate treatment 
strategies (see, for example, Sgroi et al., "In vivo Gene 
Expression Profile Analysis of Human Breast Cancer 

10 Progression," Cancer Res. 59 (22 ): 5656-61 (1999)). 

In microarray-based gene expression screens of 
pharmacological drug candidates upon cells, each probe 
provides specific useful data. In particular, it should be 
appreciated that even those probes that show no change in 

15 expression are as informative as those that do change, 
serving, in essence, as negative controls. 

For example, where gene expression anaJ.ysis is 
used to assess toxicity of chemical agents on cells, the 
failure of the agent to change a gene's expression level is 

20 evidence that the drug likely does not affect the pathway 
of which the gene's expressed protein is a part. 
Analogously, where gene expression analysis is used to 
assess side effects of pharmacological agents - whether in 
lead compound discovery or in subsequent screening of lead 

25 compound derivatives - the inability of the agent to alter 
a gene's expression level is evidence that the drug does 
not affect the pathway of which the gene's expressed 
protein is a part. 

WO 99/58720 provides methods for quantifying the 

30 relatedness of a first and second gene expression profile 
and for ordering the relatedness of a plurality of gene 
expression profiles. The methods so described permit 
useful information to be extracted from a greater 
percentage of the individual gene expression measurements 

35 from a microarray than methods previously used in the art. 
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Other uses of microarrays are described in 
Gerhold et al.. Trends Biochem. Sci. 24 (5) : 168-173 (1999) 
and Zweiger, Trends Blotechnol. 17 ( 11 ): 42 9-436 (1999); 
Schena et al . 

5 The invention particularly provides genome- 

derived single-exon probes known to be expressed in heart. 

The individual single axon probes can be provided 
in the form of substantially isolated and purified nucleic 
acid, typically, but not necessarily, in a quantity 

10 sufficient to perform a hybridization reaction. 

Such nucleic acid can be in any form directly 
hybridizable to the message that contains the probe's ORF, 
such as double stranded DNA, single-stranded DNA 
complementary to the message, single-stranded RNA 

15 complementary to the message, or chimeric DNA/RNA molecules 
so hybridizable. The nucleic acid can alternatively or 
additionally include either nonnative nucleotides, 
alternative internucleotide linkages, or both, so long as 
complementary binding can be obtained. For example, probes 

20 can include phosphorothioates, methylphosphonates, 

morpholino analogs, and peptide nucleic acids (PNA), as are 
described, for example, in U.S. Patent Nos . 5,142,047; 
5,235,033; 5,166,315; 5,217,866; 5,184,444; 5,861,250. 

Usefully, however, such probes are provided in a 

25 form and quantity suitable for amplification, where the 
amplified product is thereafter to be used in the 
hybridization reactions that probe gene expression. 
Typically, such probes are provided in a form and quantity 
suitable for amplification by PGR or by other well known 

30 amplification technique. One such technique additional to 
PGR is rolling circle amplification, as is described, inter 
alia, in U.S. Patent Nos. 5,854,033 and 5,714,320 and 
international patent publications WO 97/19193 and 
WO 00/15779, As is well understood, where the probes are 

35 to be provided in a form suitable for amplification, the 
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range of nucleic acid analogues and/or internucleotide 
linkages will be constrained by the requirements and nature 
of the amplification enzyme. 

Where the probe is to be provided in form 
5 suitable for amplification, the quantity need not be 
sufficient for direct hybridization for gene expression 
analysis, and need be sufficient only to function as an 
amplification template, typically at least about 1, _ 10 or 
100 pg or more. 

10 Each discrete amplifiable probe can also be 

packaged with amplification primers, either in a single 
composition that comprises probe template and primers, or 
in a kit that comprises such primers separately packaged 
therefrom. As earlier mentioned, the ORF-specific 

15 5' primers used for genomic amplification can have a first 
common sequence added thereto, and the ORF-specific 3' 
primers used for genomic amplification can have a second, 
different, common sequence added thereto, thus permitting, 
in this embodiment, the use of a single set of 5' and 3' 

20 primers to amplify any one of the probes. The probe 

composition and/or kit can also include buffers, enzyme, 
etc., required' to effect amplification. 

As mentioned earlier, when intended for use on a 
genome-derived single exon microarray of the present 

25 invention, the genome-derived single exon probes of the 
present invention will typically average at least about 
100, 200, 300, 400 or 500 bp in length, including (and 
typically, but not necessarily centered about) the ORF. 
Furthermore, when intended for use on a genome-derived 

30 single exon microarray of the present invention, the 

genome-derived single exon probes of the present invention 
will typically not contain a detectable label. 

When intended for use in solution phase 
hybridization, however — that is, for use in a 

35 hybridization reaction in which the probe is not first 
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bound to a support substrate (although the target may 
indeed be so bound) — length constraints that are imposed 
in microarray-based hybridization approaches will be 
relaxed, and such probes will typically be labeled. 
5 In such case, the only functional constraint that 

dictates the minimum size of such probe is that each such 
probe must be capable of specifically identifying in a 
hybridization reaction the exon from which it is drawn. In 
theory, a probe of as little as 17 nucleotides is capable 

10 of uniquely identifying its cognate sequence in the human 
genome. For hybridization to expressed message — a subset 
of target sequence that is much reduced in complexity as 
compared to genomic sequence — even fewer nucleotides are 
required for specificity. 

15 Therefore, the probes of the present invention 

can include as few as 20, 25 or 50 bp or ORF, or more. In 
particular embodiments, the ORF sequences are given in SEQ 
ID NOS. 9,981 - 19,771, respectively, for probe SEQ ID NOS . 
1 - 9,980. The minimum amount of ORF required to be 

20 included in the probe of the present invention in order to 
provide specific signal in either solution phase or 
microarray-based hybridizations can readily be determined 
for each of ORF SEQ ID NOS. 9,981 - 19,771 individually by 
routine experimentation using standard high stringency 

25 conditions. 

Such high stringency conditions are described, 
inter alia, in Ausubel et al. and Maniatis et al. For 
microarray-based hybridization, standard high stringency 
conditions can usefully be 50% formamide, 5X SSC, 0.2 pg/jal 

30 poly(dA), 0.2 pg/pl human Cotl DNA, and 0.5 % SDS, in a 

humid oven at 42°C overnight, followed by successive washes 
of the microarray in IX SSC, 0.2% SDS at 55°C for 5 
minutes, and then O.IX SSC, 0.2% SDS, at 55°C for 20 
minutes. For solution phase hybridization, standard high 

35 stringency conditions can usefully be aqueous hybridization 
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at 65°C in 5X SSC. Lower stringency conditions, suitable 
for cross-hybridization to itiRNA encoding structurally- and 
functionally-related proteins, can usefully be the same as 
the high stringency conditions but with reduction in 
5 temperature for hybridization and washing to room 
temperature (approximately 25°C) . 

When intended for use in solution phase 
hybridization, the maximum size of the single exon probes 
of the present invention is dictated by the proximity of 
10 other expressed exons in genomic DNA: although each single 
exon probe can include intergenic and/or intronic material 
contiguous to the ORF in the human genome, each probe of 
the present invention will include portions of only one 
expressed exon. 

15 Thus, each single exon probe will include no more 

than about 25 kb of contiguous genomic sequence, more 
typically no more than about 20 kb of contiguous genomic 
sequence, more usually no more than about 15 kb, even more 
usually no more than about 10 kb. Usually, probes that are 

20 maximally about 5 kb will be used, more typically no more 
than about 3 kb. 

It will be appreciated that the Sequence Listing 
appended hereto presents, by convention, only that strand 
of the probe and ORF sequence that can be directly 

25 translated reading from 5' to 3' end. As would be well 
understood by one of skill in the art, single stranded 
probes must be complementary in sequence to the ORF as 
present in an mRNA; it is well within the skill in the art 
to determine such complementary sequence. It will further 

30 be understood that double stranded probes can be used in 
both solution-phase hybridization and microarray-based 
hybridization if suitably denatured. 

Thus, it is an aspect of the present invention to 
provide single-stranded nucleic acid probes that have 

35 sequence complementary to those described herein above and 
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below, and double-stranded probes one strand of which has 
sequence complementary to the probes described herein. 

The probes can, but need not, contain intergenic 
and/or intronic material that flanks the ORF, on one or 
5 both sides, in the same linear relationship to the ORF that 
the intergenic and/or intronic material bears to the ORF in 
genomic DNA. The probes do not, however, contain nucleic 
acid derived from more than one expressed ORF. 

And when intended for use in solution 
10 hybridization, the probes of the present invention can 
' usefully have detectable labels. Nucleic acid labels are 
well known in the art, and include,' inter alia, radioactive 
labels, such as ^H, ^^p^ 33p^ 353^ 125-^^ i3ij. fluorescent 
labels, such as Cy3, Cy5, Cy5.5, Cy7, SYBR® 
15 Green and other labels described in Haugland, 

Handbook of Fluorescent Probes and Research Chemicals, 7th 
ed.. Molecular Probes Inc., Eugene, OR (2000), or 
fluorescence resonance energy transfer tandem conjugates 
thereof; labels suitable for chemiluminescent and/or 
20 enhanced chemiluminescent detection; labels suitable for 
ESR and NMR detection; and labels that include one member 
of a specific binding pair, such as biotin, digoxigenin, or 
the like. 

The probes, either in quantity sufficient for 
25 hybridization or sufficient for amplification, can be 
provided in individual vials or containers. 

Alternatively, such probes can usefully be 
packaged as a plurality of such individual genome-derived 
single exon probes. 
30 When provided as a collection of plural 

individual probes, the probes are typically made available 
in amplifiable form in a spatially-addressable ordered set, 
typically one per well of a microtiter dish. Although a 96 
well microtiter plate can be used, greater efficiency is 
35 obtained using higher density arrays. 
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If, as earlier mentioned, the ORF-specific 
5' primers used for genomic amplification had a first 
common sequence added thereto, and the ORF-specific 3' 
primers used for genomic amplification had a second, 
5 different, common sequence added thereto, a single set of 
5 ' and 3 ' primers can be used to amplify all of the probes 
from the amplifiable ordered set. 

Such collections of genome-derived single exon 
probes can usefully include a plurality of probes chosen 
10 for the common attribute of expression in the human heart. 

In such defined subsets, typically at least 50, 
60, 75, 80, 85, 90 or 95% or more of the probes will be 
chosen by their expression in the defined tissue or cell 
type. 

15 The single exon probes of the present invention, 

as well as fragments of the single exon probes comprising 
selectively hybridizable portions of the probe ORF, can be 
used to obtain the full length cDNA that includes the ORF 
by (i) screening of cDNA libraries; (ii) rapid 

20 amplification of cDNA ends ("ElACE"); or (iii) other 
conventional means, as are described, inter alia, in 
Ausubel et al. and Maniatis et al. 

It is another aspect of the present invention to 
provide genome-derived single exon nucleic acid microarrays 

25 useful for gene expression analysis, where the term 

"microarray" has the meaning given in the definitional 
section of this description, supra. ■ 

The invention particularly, provides genome- 
derived single-exon nucleic acid microarrays comprising a 

30 plurality of probes known to be expressed in human heart. 
In preferred embodiments, the present invention provides 
human genome-derived single exon microarrays comprising a 
plurality of probes drawn from the group consisting of SEQ 
ID NOS. : 1 - 9, 980. 

35 When used for gene expression analysis, the 
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genome-derived single exon microarrays provide greater 
physical informational density than do the genome-derived 
single exon microarrays that have lower percentages of 
probes known to be expressed commonly in the tested tissue. 
5 At a fixed probe density, for example, a given microarray 
surface area of the defined subset genome-derived single 
exon microarray can yield a greater number of expression 
measurements. Alternatively, at a given probe density, the 
same number of expression measurements can be obtained from 

10 a smaller substrate surface area. Alternatively, at a 

fixed probe density and fixed surface area, probes can be 
provided redundantly, providing greater reliability in 
signal measurement for any given probe. Furthermore, with 
a higher percentage of probes known to be expressed in the 

15 assayed tissue, the dynamic range of the detection means 

can be adjusted to reveal finer levels discrimination among 
the levels of expression. 

Although particularly described with respect to 
their utility as probes of gene expression, particularly as 

20 probes to be included on a genome-derived single exon 

microarray, each of the nucleic acids having SEQ ID NOS.: 1 
- 9,980 contains an open-reading frame, set forth 
respectively in SEQ ID NOS.: 9,981 - 19,771, that encodes a 
protein domain. Thus, each of SEQ ID NOS. 1 - 9,98 0 can be 

25 used, or that portion thereof in SEQ ID NOS. 9,981 - 19,771 
used, to express a protein domain by standard in vitro 
recombinant techniques. See Ausubel et al . and Maniatis et 
al. 

Additionally, kits are available commercially 
30 that readily permit such nucleic acids to be expressed as 
protein in bacterial cells, insect cells, or mammalian 
cells, as desired (e.g., HAT™ Protein Expression & 
Purification System, ClonTech Laboratories, Palo Alto, CA; 
Adeno-X™ Expression System, ClonTech Laboratories, Palo 
35 Alto, CA; Protein Fusion & Purification (pMAL™) System, New 
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England Biolabs, Beverley, MA) 

Furthermore, shorter peptides can be chemically 
synthesized using commercial peptide synthesizing equipment 
and well known techniques. Procedures are described. Inter 
5 alia, in Chan et al. (eds.), Fmoc Solid Phase Peptide 

Synthesis: A Practical Approach (Practical Approach Series, 
(Paper)), Oxford Univ. Press (March 2000) (ISBN: 
0199637245) ; Jones, Amino Acid and Peptide Synthesis 
(Oxford Chemistry Primers, No 7) , Oxford Univ. Press 
10 (August 1992) (ISBN: 0198556683); and Bodanszky, Principles 
of Peptide Synthesis (Springer Laboratory) , Springer Verlag 
(December 1993) (ISBN: 0387564314). 

It is, therefore, another aspect of the invention 
to provide peptides comprising an amino acid sequence 
15 translated from SEQ ID NOS . : 9,981 - 19,771. Such amino 

acid sequences are set out in SEQ ID NOS: 19,772 - 29,119. 
Any such recombinantly-expressed or synthesized peptide of 
at least 8, and preferably at least about 15, amino acids, 
can be conjugated to a carrier protein and used to generate 
20 antibody that recognizes the peptide. Thus, it is a 

further aspect of the invention to provide peptides that 
have at least 8, preferably at least 15, consecutive amino 
acids . 

25 The following examples are offered by way of 

illustration and not by way of limitation. 

EXAMPLE 1 

Preparation of Single Exon Microarrays from ORFs Predicted 
30 in Human Genomic Sequence 

Bioinf ormatlcs Results 

All human BAG sequences in fewer than 10 pieces 
that had been accessioned in a five month period 
35 immediately preceding this study were downloaded from 
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GenBank. This corresponds to -2200 clones, totaling -350 
MB of sequence, or approximately 10% of the human genome. 

After masking repetitive elements using the 
program CROSS_MATCH, the sequence was analyzed for open 
5 reading frames using three separate gene finding programs. 
The three programs predict genes using independent 
algorithmic methods developed on independent training sets: 
GRAIL uses a neural network, GENEFINDER uses a hidden 
Markoff model, and DICTION, a program proprietary to 
10 Genetics Institute, operates according to a different 

heuristic. The results of all three programs were used to 
create a prediction matrix across the segment of genomic 
DNA. 

The three gene finding programs yielded a range 
15 of results. GRAIL identified the greatest percentage of 
genomic sequence as putative coding region, 2% of the data 
analyzed. GENEFINDER was second, calling 1%, and DICTION 
yielded the least putative coding region, with 0.8% of 
genomic sequence called as coding region. 
20 The consensus data were as follows. GRAIL and 

GENEFINDER agreed on 0.7% of genomic sequence, GRAIL and 
DICTION agreed on 0.5% of genomic sequence, and the three 
programs together agreed on 0.25% of the data analyzed. 
That is, 0.25% of the genomic sequence was identified by 
25 all three of the programs as containing putative coding 
region. 

ORFs predicted by any two of the three programs 
("consensus ORFs") were assorted into "gene bins" using two 
criteria: (1) any 7 consecutive exons within a 25 kb window 
30 were placed together in a bin as likely contributing to a 
single gene, and (2) all ORFs within a 25 kb window were 
placed together in a bin as likely contributing to a single 
gene if fewer than 7 exons were found within the 25 kb 
window . 
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PGR - 

The largest ORF from each gene bin that did not 
span repetitive sequence was then chosen for amplification, 
as were all consensus ORFs longer than 500 bp. This method 
5 approximated one axon per gene/ however, a number of genes 
were found to be represented by multiple elements. 

Previously, we had determined that DNA fragments 
fewer than 250 bp in length do not bind well to the amino- 
modified glass surface of the slides used as support 
10 substrate for construction of microarrays; therefore, 
amplicons were designed in the present experiments to 
approximate 500 bp in length. 

Accordingly, after selecting the largest ORF per 
gene bin, a 500 bp fragment of sequence centered on the ORF 
15 was passed to the primer picking software, PRIMER3 
(available online for use at 

http://www-genome.wi.mit.edu/cgi-bin/primer/ ). A first 
additional sequence was commonly added to each ORF-unique 
5' primer, and a second, different, additional sequence was 

20 commonly added to each ORF-unique 3' primer, to permit 

subsequent reamplif ication of the amplicon using a single 
set of "universal" 5' and 3' primers, thus immortalizing 
the amplicon. The addition of universal priming sequences 
also facilitates sequence verification, and can be used to 

25 add a cloning site should some ORFs be found to warrant 
further study. 

The ORFs were then PGR amplified from genomic 
DNA, verified on agarose gels, and sequenced using the 
universal primers to validate the identity of the amplicon 

30 to be spotted in the microarray. 

Primers were supplied by Operon Technologies 
(Alameda, GA) . PGR amplification was performed by standard 
techniques using human genomic DNA (Clontech, Palo Alto, 
GA) as template. Each PGR product was verified by SYBR® 

35 green (Molecular Probes, Inc., Eugene, OR) staining of 

79 



wo 01/57274 PCT/USOl/00666 
agarose gels, with subsequent imaging by Fluorimager 
(Molecular Dynamics, Inc., Sunnyvale, CA) . PGR 
amplification was classified as successful if a single band 
appeared. 

5 The success rate for amplifying ORFs of interest 

directly from genomic DNA using PGR was approximately 75%. 
FIG. 5 graphs the distribution of predicted ORF (axon) 
length and distribution of amplified PGR products, with ORF 
length shown in red and PGR product length shown in blue 

10 (which may appear black in the figure) . Although the range 
of ORF sizes is readily seen to extend to beyond 900 bp, 
the mean predicted exon size was only 22 9 bp, with a median 
size of 150 bp (n=9498) . With an average amplicon size of 
475 + 25 bp, approximately 50% of the average PGR 

15 amplification product contained predicted coding region, 
with the remaining 50% of the amplicon containing either 
intron, intergenic sequence, or both. 

Using a strategy predicated on amplifying about 
500 bp, it was found that long exons had a higher PGR 

20 failure rate. To address this, the bioinf ormatics process 
was adjusted to amplify 1000, 1500 or 2000 bp fragments 
from exons larger than 500 bp. This improved the rate of 
successful amplification of exons exceeding 500 bp, 
constituting about 9.2% of the exons predicted by the gene 

25 finding algorithms. 

Approximately 75% of the probes disposed on the 
array (90% of those that successfully PGR amplified) were 
sequence-verified by sequencing in both the forward and 
reverse direction using MegaBAGE sequencer (Molecular 

30 Dynamics, Inc., Sunnyvale, GA) , universal primers, and 
standard protocols. 

Some genomic clones (BACs) yielded very poor PGR 
and sequencing results. The reasons for this are unclear, 
but may be related to the quality of early draft sequence 

35 or the inclusion of vector and host contamination in some 
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submitted sequence data. 

Although the intronic and intergenic material 
flanking coding regions could theoretically interfere with 
hybridization during microarray experiments, subsequent 
5 empirical results demonstrated that differential expression 
ratios were not significantly affected by the presence of 
noncoding sequence. The variation in axon size was 
similarly found not to affect differential expression 
ratios significantly; however, variation in axon size was 
10 observed to affect the absolute signal intensity (data not 
shown) . 

The 350 MB of genomic DNA was, by the above- 
described process, reduced to 9750 discrete probes, which 
were spotted in duplicate onto glass slides using 

15 commercially available instrumentation (MicroArray Genii 
Spotter and/or MicroArray Genlll Spotter, Molecular 
Dynamics, Inc., Sunnyvale, CA) . Each slide additionally 
included either 16 or 32 E. coll genes, the average 
hybridization signal of which was used as a measure of 

20 background biological noise. 

Each of the probe sequences was BLASTed against 
the human EST data set, the NR data set, and SwissProt 
GenBank (May 7, 1999 release 2.0.9). 

One third of the probe sequences (as amplified) 

25 produced an exact match (BLAST Expect ("E") values less 

than 1 e'^°°) to either an EST (20% of sequences) or a known 
mRNA (13% of sequences) . A further 22% of the probe 
sequences showed some homology to a known EST or mRNA 
(BLAST E values from 1 e"^ to 1 e"^^) . The remaining 45% of 

30 the probe sequences showed no significant sequence homology 
to any expressed, or potentially expressed, sequences 
present in public databases. 

All of the probe sequences (as amplified) were 
then analyzed for protein similarities with the SwissProt 

35 database using BLASTX, Gish et al., Nature Genet. 3:266 
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(1993) . The predicted functional breakdowns of the 2/3 of 
probes identical or homologous to known sequences are 
presented in Table 1. 



5 Table 1 



Function of Predicted ORFs As Deduced From Comparative 

Sequence Analysis 

Total V6 chip V7 chip Function Predicted from 
Comparative Sequence 

Analysis 


211 


96 


115 


Receptor 


120 


43 


77 


Zinc Finger 


30 


11 


19 


Homeobox 


25 


9 


16 


Transcription Factor 


17 


11 


7 


Transcription 


118 


57 


61 


Structural 


95 


3 9 


56 


Kinase 


36 


18 


18 


Phosphatase 


83 


31 


52 


Ribosomal 


45 


19 


26 


Transport 


21 


17 


14 


Growth Factor 


17 


12 


5 


Cytochrome 


50 


33 


17 


Channel 



As can be seen, the two most common types of 
genes were transcription factors and receptors, making up 
2.2% and 1.8% of the arrayed elements, respectively. 

10 



EXAMPLE 2 

Gene Expression Measurements From Genome-Derived Single 
Exon Microarrays 
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The two genome-derived single exon microarrays 
prepared according to Example 1 were hybridized in a series 
of simultaneous two-color fluorescence experiments to (1) 
5 Cy3-labeled cDNA synthesized from message drawn 

individually from each of brain, heart, liver, fetal liver, 
placenta, lung, bone marrow, HeLa, BT 474, or HBL 100 
cells, and (2) Cy5-labeled cDNA prepared from message 
pooled from all ten tissues and cell types, as a control in 
10 each of the measurements. Hybridization and scanning were 
carried out using standard protocols and Molecular Dynamics 
equipment . 

Briefly, mRNA samples were bought from commercial 
sources (Clontech, Palo Alto, CA and Amersham Pharmacia 

15 Biotech (APE) ) . Cy3-dCTP and Cy5-dCTP (both from APB) were 
incorporated during separate reverse transcriptions of -1 \iq 
of polyA"^ mRNA performed using 1 pg oligo (dT) 12-18 primer 
and 2 \ig random 9mer primers as follows. After heating to 
70°C, the RNA:primer mixture was snap cooled on ice. After 

20 snap cooling on ice, added to the RNA to the stated final 
concentration was: IX Superscript II buffer, 0.01 M DTT, 
lOOpM dATP, 100 pM dGTP, 100 pM dTTP, 50 pM dCTP, 50 pM 
Cy3-dCTP or Cy5-dCTP 50 pM, and 200 U Superscript II 
enzyme. The reaction was incubated for 2 hours at 42°C. 

25 After 2 hours, the first strand cDNA was isolated by adding 
1 U Ribonuclease H, and incubating for 30 minutes at 37°C. 
The reaction was then purified using a Qiagen PGR cleanup 
column, increasing the number of ethanol washes to 5. 
Probe was eluted using 10 mM Tris pH 8.5. 

30 Using a spectrophotometer, probes were measured 

for dye incorporation. Volumes of both CyS and Cy5 cDNA 
corresponding to 50 pmoles of each dye were then dried in a 
Speedvac, resuspended in 30 pi hybridization solution 
containing 50% formamide, 5X SSC, 0.2 pg/pl poly(dA), 0.2 

35 pg/pl human Cotl DNA, and 0.5 % SDS. 
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Hybridizations viere carried out under a 
coverslip, with the array placed in a humid oven at 42°C 
overnight. Before scanning, slides were washed in IX SSC, 
0.2% SDS at 55°C for 5 minutes, followed by 0 . IX SSC, 0.2% 
5 SDS, at 55°C for 20 minutes. Slides were briefly dipped in 
water and dried thoroughly under a gentle stream of 
nitrogen . 

Slides were scanned using a Molecular Dynamics 
Gen3 scanner, as described. Schena (ed.), Microarray 

10 Blochlp: Tools and Technology , Eaton Publishing 
Company /BioTechniques Books Division (2000) (ISBN: 
1881299376) . 

Although the use of pooled cDNA as a reference 
permitted the survey of a large number of tissues, it 

15 attenuates the measurement of relative gene expression, 

since every highly expressed gene in the tissue/cell type- 
specific fluorescence channel will be present to a level of 
at least 10% in the control channel. Because of this fact, 
both signal and expression ratios (the latter hereinafter, 

20 "expression" or "relative expression") for each probe were 
normalized using the average ratio or average signal, 
respectively, as measured across the whole slide. 

Data were accepted for further analysis only when 
signal was at least three times greater than biological 

25 noise, the latter defined by the average signal produced by 
the E. coll control genes. 

The relative expression signal for these probes 
was then plotted as function of tissue or cell type, and is 
presented in FIG. 6. 

30 FIG. 6 shows the distribution of expression 

across a panel of ten tissues. The graph shows, the number 
of sequence-verified products that were either not 
expressed ("0"), expressed in one or more but not all 
tested tissues ("1" - "9"), and expressed in all tissues 

35 tested ("10"). 
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Of 9999 arrayed elements on the two microarrays 
(including positive and negative controls and "failed" 
products), 2353 (51%) were expressed in at least one tissue 
or cell type. Of the gene elements showing significant 
5 signal — where expression was scored as "significant" if 
the normalized Cy3 signal was greater than 1, representing 
signal 5-fold over biological noise (0.2) - 39% (991) were 
expressed in all 10 tissues. The next most common class 
(15%) consisted of gene elements expressed in only a single 
10 tissue. 

The genes expressed in a single tissue were 
further analyzed, and the results of the analyses are 
compiled in FIG. 7. 

FIG. 7A is a matrix presenting the expression of 

15 all verified sequences that showed expression greater than 
3 in at least one tissue. Each clone is represented by a 
column in the matrix. Each of the 10 tissues assayed is 
represented by a separate row in the matrix, and relative 
expression of a clone in that tissue is indicated at the 

20 respective node by intensity of green shading, with the 
intensity legend shown in panel B. The top row of the 
matrix ("EST Hit") contains "bioinf ormatic" rather than 
"physical" expression data - that is, presents the results 
returned by query of EST, NR and SwissProt databases using 

25 the probe sequence. The legend for "bioinf ormatic 
expression" (i.e., degree of homology returned) is 
presented in panel C. Briefly, white is known, black is 
novel, with gray depicting nonidentical with significant 
homology (white: E values < le-100; gray: E values from le- 

30 05 to le-99; black: E values > le-05) . 

As FIG. 7 readily shows, heart and brain were 
demonstrated to have the greatest numbers of genes that 
were shown to be uniquely expressed in the respective 
tissue. In brain, 200 uniquely expressed genes were 

35 identified; in heart, 150. The remaining tissues gave the 
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following figures for uniquely expressed genes: liver, 100; 
lung, 70; fetal liver, 150; bone marrow, 75; placenta, 100; 
HeLa, 50; HBL, 100; and BT474, 50. 

It was further observed that there were many more 
5 "novel" genes among those that were up-regulated in only 
one tissue, as compared with those that were down-regulated 
in only one tissue. In fact, it was found that ORFs whose 
expression was measurable in only a single of the tested 
tissues were represented in sequencing databases at a rate 

10 of only 11%, whereas 36% of the ORFs whose expression was 
measurable in 9 of the tissues were present in public 
databases. As for those ORFs expressed in all ten tissues, 
fully 45% were present in existing expressed sequence 
databases. These results are not unexpected, since genes 

15 expressed in a greater number of tissues have a higher 

likelihood of being, and thus of having been, discovered by 
EST approaches. 



Comparison of Signal from Known and Unknown Genes 
20 The normalized signal of the genes found to have 

high homology to genes present in the GenBank human EST 
database were compared to the normalized signal of those 
genes not found in the GenBank human EST database. The 
data are shown in FIG. 8. 
25 FIG. 8 shows the normalized Cy3 signal intensity 

for all sequence-verified products with a BLAST Expect 
("E") value of greater than le-30 (designated "unknown") 
upon query of existing EST, NR and SwissProt databases, and 
shows in blue the normalized Cy3 signal intensity for all 
30 sequence-verified products with a BLAST Expect value of 

less than le-30 ("known") . Note that biological background 
noise has an averaged normalized Cy3 signal intensity of 
0.2. 

As expected, the most highly expressed of the 
35 ORFs were "known" genes. This is not surprising, since 
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very high signal intensity correlates with very commonly- 
expressed genes, which have a higher likelihood of being 
found by EST sequence. 

However, a significant point is that a large 
5 number of even the high expressers were "unknown" . Since 
the genomic approach used to identify genes and to confirm 
their expression does not bias exons toward either the 3' 
or 5' end of a gene, many of these high expression genes 
will not have been detected in an end-sequenced cDNA 
10 library. 

The significant point is that presence of the 
gene in an EST database is not a prerequisite for 
incorporation into a genome-derived microarray, and 
further, that arraying such "unknown" exons can help to 
15 assign function to as-yet undiscovered genes. 



Verification of Gene Expression 

To ascertain the validity of the approach 
described above to identify genes from raw genomic 

20 sequence, expression of two of the probes was assayed using 
reverse transcriptase polymerase chain reaction (RT PGR) 
and northern blot analysis. 

Two microarray probes were selected on the basis 
of exon size, prior sequencing success, and tissue-specific 

25 gene expression patterns as measured by the microarray 
experiments. The primers originally used to amplify the 
two respective ORFs from genomic DNA were used in RT PGR 
against a panel of tissue-specific cDNAs (Rapid-Scan gene 
expression panel 24 human cDNAs) (OriGene Technologies, 

30 Inc., Rockville, MD) . 

Sequence AL07 9300_1 was shown by microarray 
hybridization to be present in cardiac tissue, and sequence 
AL031734_1 was shown by microarray experiment to be present 
in placental tissue (data not shown) . RT-PGR on these two 

35 sequences confirmed the tissue-specific gene expression as 
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measured by microarrays, as ascertained by the presence of 
a correctly sized PGR product from the respective tissue 
type cDNAs. 

Clearly, all microarray results cannot, and 
5 indeed should not, be confirmed by independent assay 

methods, or the high throughput, highly parallel advantages 
of microarray hybridization assays will be lost. However, 
in addition to the two RT-PCR results presented above, the 
observation that 1/3 of the arrayed genes exist in 

10 expression databases provides powerful confirmation of the 
power of our methodology - which combines bioinf ormatic 
prediction with expression confirmation using genome- 
derived single exon microarrays - to identify novel genes 
from raw genomic data. 

15 To verify that the approach further provides 

correct characterization of the expression patterns of the 
identified genes, a detailed analysis was performed of the 
microarrayed sequences that showed high signal in brain. 

For this latter analysis, sequences that showed 

20 high (normalized) signal in brain, but which showed very 
low (normalized) signal (less than 0.5, determined to be 
biological noise) in all other tissues, were further 
studied. There were 82 sequences that fit these criteria, 
approximately 2% of the arrayed elements. The 10 sequences 

25 showing the highest signal in brain in microarray 

hybridizations are detailed in Table 2, along with assigned 
function, if known or reasonably predicted. 

Table 2 

Function of the Most Highly 
Expressed Genes Expressed Only in Brain 
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Microarray Normal Express! Homology Gene Function 
Sequence ized on Ratio to EST as described by 
Name Signal present GenBank 
in 

GenBank 


AP000217-1 


5.2 


+7.7 


High 


S-100 protein, 
b-chain, Ca^"^ 
binding protein 
expressed in 
central nervous 
system 


AP000047-1 


2.3 




High 


Unknown 
Function 


AC006548-9 


1.7 




High 


Similar to 
mouse membrane 
glyco-protein 
M6, expressed 
in central 
nervous system 


AC007245-5 


1.5 




High 


Similar to 
amphiphysin, a- 
synaptic 
vesicle- 
associated 
protein. Ref 21 


L44140-4 


1.2 


+2.0 


High 


Endothelial 
act in-binding 
protein found 
in nonmuscle 
f ilamin 



89 



wo 01/57274 



PCT/USOl/00666 



AC004689-9 


1.2 


+3.5 


High 


Protein 
Phosphatase 
PP2A, neuronal/ 
downregulates 
activated 
protein kinases 


AL031657-1 


1.2 


+3.0 


High 


Unknown 
function/ 
Contains the 
anhyrin motif, 
a common 
protein 
sequence motif 


AC009266-2 


1.1 


+3.7 


Low 


Low homology to 

the 

Synaptotagmin I 
protein in 
rat/present at 
low levels 
throughout rat 
brain 


AP000086-1 


1.0 


+2.7 


Low 


Unknown, very 
poor homology 
to collagen 


AC004589-3 


1.0 




•High 


Protein 
Phosphatase 
PP2A, neuronal/ 
downregulates 
activated 
protein kinases 



Of the ten sequences studied by these latter 
confirmatory approaches, eight were previously known. Of 
these eight, six had previously been reported to be 
important in the central nervous system or brain. The exon 
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giving the highest signal (AP00217-1) was found to be the 
gene encoding an SIOOB Ca^"^ binding protein, reported in 
the literature to be highly and uniquely expressed in the 
central nervous system. Heizmann, Neurochem. Res. 9:1097 
5 (1997). 

A number of the brain-specific probe sequences 
(including AC006548-9, AC009266-2) did not have homology to 
any known human cDNAs in GenBank but did show homology to 
rat and mouse cDNAs . Sequences AC004689-9 and AC004689-3 

10 were both found to be phosphatases present in neurons 
(Millward et al., Trends Biochem. Sci. 24 (5) : 186-191 
(1999)). Two microarray sequences, AP000047-1 and 
AP000086-1 have unknown function, with AP000086-1 being 
absent from GenBank. Functionality can now be narrowed 

15 down to a role in the central nervous system for both of 
these genes, showing the power of designing microarrays in 
this fashion. 

Next, the function of the chip sequences with the 
highest (normalized) signal intensity in brain, regardless 

20 of expression in other tissues, was assessed. In this 
latter analysis, we found expression of many more common 
genes, since the sequences were not limited to those 
expressed only in brain. For example, looking at the 20 
highest signal intensity spots in brain, 4 were similar to 

25 tubulin (AC00807905; AF146191-2; AC007664-4; AF14191-2), 2 
were similar to actin (AL035701-2; AL034402-1) , and 6 were 
found to be homologous to glyceraldehyde-3-phosphate 
dehydrogenase (GAPDH) (AL035604-1; Z86090-1; AC006064-L, 
AC006064-K; AC035604-3; AC006064-L) . These genes are often 

30 used as controls or housekeeping genes in microarray 
experiments of all types. 

Other interesting genes highly expressed in brain 
were a ferritin heavy chain protein, which is reported in 
the literature to be found in brain and liver (Joshi et 

35 ai., J. Neurol. Sci. 134 (Suppl) : 52-56 (1995)), a result 

91 



wo 01/57274 PCT/USOl/00666 

duplicated with the array. Other highly expressed chip 
sequences included a translation elongation factor ID 

(AC007564-4) , a DEAD-box homolog (AL023804-4 ) , and a Y- 
chromosome RNA-binding motif (Chai et ai., Genomics 
5 49(2):283-89 (1998 ) ) (AC007320-3 ) . A low homology analog 

(AP00123-1/2) to a gene, DSCRl, thought to be involved in 
trisomy 21 (Down's syndrome), showed high expression in 
both brain and heart, in agreement with the literature 

(Fuentes et al., Mol. Genet. 4(10):1935-44 (1995)). 
10 As a further validation of the approach, we 

selected the BAG AC006064 to be included on the array. 
This BAG was known to contain the GAPDH gene, and thus 
could be used as a control for the ORF selection process. 
The gene finding and exon selection algorithms resulted in 
15 choosing 25 exons from BAG AC006064 for spotting onto the 
array, of which four were drawn from the GAPDH gene. Table 
3 shows the comparison of the average expression ratio for 
the 4 exons from BAC006064 compared with the average 
expression ratio for 5 different dilutions of a 
20 commercially available GAPDH cDNA (Clontech) . 



Table 3 



Comparison of Expression Ratio, for each 
tissue, of GAPDH 




AC'')0 60 64 (n = 4) 


Control ( n = 5) 


Bone Marrow 


-1.81 + 0.11 


-1.85 ± 0.08 


Brain 


-1.41 ± 0.11 


-1.17 ± 0.05 


BT474 


1.85 ± 0.09 


1.66 + 0.12 


Fetal Liver 


-1.62 ± 0.07 


-1.41 ± 0.05 


HBLIOO 


1.32 ± 0.05 


2.64 ± 0.12 


Heart 


1.16 ± 0.09 


1.56 ± 0.10 


HeLa 


1.11 ±0.06 


1.30 ± 0.15 


Liver 


-1.62 ± 0.22 


-2.07 + 
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Lung 


-4.95 ± 0.93 


-3.75 + 0.21 


Placenta 


-3.56 ± 0.25 


-3.52 + 0.43 



Each tissue shows excellent agreement between the 
experimentally chosen axons and the control, again 
5 demonstrating the validity of the present exon mining 

approach. In addition, the data also show the variability 
of expression of GAPDH within tissues, calling into 
question its classification as a housekeeping gene and 
utility as a housekeeping control in microarray 
10 experiments. 

EXAMPLE 3 

Representation of Sequence and Expression Data as a 
"Mondrian" 

15 

For each genomic clone processed for microarray 
as above-described, a plethora of information was 
accumulated, including full clone sequence, probe sequence 
within the clone, results of each of the three gene finding 

20 programs, EST information associated with the probe 
sequences, and microarray signal and expression for 
multiple tissues, challenging our ability to display the 
information. 

Accordingly, we devised a new tool for visual 

25 display of the sequence with its attendant annotation 
which, in deference to its visual similarity to the 
paintings of Piet Mondrian, is hereinafter termed a 
"Mondrian". FIGS. 3 and 4 present the key to the 
information presented on a Mondrian. 

30 FIG. 9 presents a Mondrian of BAG AC008172 (bases 

25,000 to 130,000 shown), containing the carbamyl phosphate 
synthetase gene (AF154830 . 1) . Purple background within the 
region shown as field 81 in FIG. 3 indicates all 37 known 
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exons for this gene. 

As can be seen, GRAIL II successfully identified 
27 of the known exons (73%), GENEFINDER successfully 
identified 37 of the known exons (100%), while DICTION 
5 identified 7 of the known exons (19%). 

Seven of the predicted exons were selected for 
physical assay, of which 5 successfully amplified by PGR 
and were sequenced. These five exons were all found to be 
from the same gene, the carbamyl phosphate synthetase gene 

10 (AF154830.1). 

The five exons were arrayed, and gene expression 
measured across 10 tissues. As is readily seen in the 
Mondrian, the five chip sequences on the array show 
identical expression patterns, elegantly demonstrating the 

15 reproducibility of the system. 

FIG. 10 is a Mondrian of BAG AL049839. We 
selected 12 exons from this BAG, of which 10 successfully 
sequenced, which were found to form between 5 and 6 genes. 
Interestingly, 4 of the genes on this BAG are protease 

20 inhibitors. Again, these data elegantly show that exons 
selected from the same gene show the same expression 
patterns, depicted below the red line. From this figure, 
it is clear that our ability to find known genes is very 
good. A novel gene is also found from 8 5.6 kb to 88.6 kb, 

25 upon which all the exon finding programs agree. We are 
confident we have two exons from a single gene since they 
show the same expression patterns and the exons are 
proximal to each other. Backgrounds in the following 
colors indicate a known gene (top to bottom) : 

30 red = kallistatin protease inhibitor (P29622); 

purple = plasma serine protease inhibitor (P05154) ; 
turquoise = al anti-chymotrypsin (POlOll) ; mauve = 40S 
ribosomal protein (P08865) . Note that chip sequence 8 and 
12 did not sequence verify. 
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EXAMPLE 4 

Genome-Derived Single Exon Probes Useful For Measuring 
Human Gene Expression 

5 

The protocols set forth in Examples 1 and 2, 
supra, were applied to additional human genomic sequence as 
it became newly available in GenBank to identify unique 
exons in the human genome that could be shown to be 

10 expressed at significant levels in heart tissue. 

These unique exons are within longer probe 
sequences. Each probe was completely sequenced on both 
strands prior to its use on a genome-derived single exon 
microarray; sequencing confirms the exact chemical 

15 structure of each probe. An added benefit of sequencing is 
that it placed us in possession of a set of single base- 
incremented fragments of the sequenced nucleic acid, 
starting from the sequencing primer 3' OH. (S ince the 
single exon probes were first obtained by PGR amplification 

20 from genomic DNA, we were of course additionally in 

possession of an even larger set of single base incremented 
fragments of each of the 9,980 single exon probes, each 
fragment corresponding to an extension product from one of 
the two amplification primers.) 

25 The structures of the 9,980 unique single exon 

probes are clearly presented in the Sequence Listing as SEQ 
ID Nos.: 1 - 9,980. The 16 nt 5' primer sequence and 16 nt 
3 ' primer sequence present on the amplicon are not included 
in the sequence listing. The sequences of the exons 

30 present within each of these probes is presented in the 
Sequence Listing as SEQ ID Nos.: 9,981 - 19,771, 
respectively. It will be noted that some amplicons have 
more than one exon, some exons are contained in more than 
one amplicon. 

35 As detailed in Example 2, expression was 
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demonstrated by disposing the amplicons as single exon 
probes on nucleic acid microarrays and then performing two- 
color fluorescent hybridization analysis; significant 
expression is based on a statistical confidence that the 
5 signal is significantly greater than negative biological 
control spots. The negative biological control is formed 
from spotted DNA sequences from a different species. Here, 
32 sequences from E.Coli were spotted in duplicate to give 
a total of 64 spots. 
10 For each hybridisation (each slide, each colour) 

the median value of the signal from all of the spots is 
determined. The normalised signal value is the arithmetic 
mean of the signal from duplicate spots divided by the 
population median. 
15 Control spots are eliminated if there is more 

that a five-fold difference between each one of the 
duplicate spots raw signals. 

The median of the signal from the remaining 
control spots is calculated and all subsequent calculations 
20 are done with normalised signals. 

Control spots having a signal of greater than 
median +2.4 (the value 2.4 is roughly 12 times the 
observed standard deviation of control spot populations) 
are eliminated. Spots with such high signals are considered 
25 to be "outliers". 

The mean and standard deviation of the modified 
control spot populations are calculated. 

The mean + 3x the standard deviation (mean + 
(3*SD) ) is used as the signal threshold qualifier for that 
30 particular hybridisation. Thus, individual thresholds are 
determined for each channel and each hybridisation. 

This means that, assuming that the data is 
distributed normally, there is a 99% confidence that any 
signal exceeding the threshold is significant. 
35 The probes and their expression data are 
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presented in Table 4, set forth respectively in Example 5. 
Example 5 presents the subset of probes that is 
significantly expressed in the human heart and thus 
presents the subset of probes that was recognized to be 
5 useful for measuring expression of their cognate genes in 
human heart tissue. 

The sequence of each of the axon probes 
identified by SEQ ID NOS . : 9,981 - 19,771 was individually 
used as a BLAST (or, for SWISSPROT, BLASTX) query to 
10 identify the most similar sequence in each of dbEST, 

SwissProt (BLASTX) , and NR divisions of GenBank. Because 
the query sequences are themselves derived from genomic 
sequence in GenBank, only nongenoraic hits from NR were 
scored. 

15 The smallest in value of the BLAST (or BLASTX) 

expect ("E") scores for each query sequence across the 
three database divisions was used as a measure of the 
"expression novelty" of the probe's ORF. Table 4 is sorted 
in descending order based on this measure, reported as 

20 "Most Similar (top) Hit BLAST E Value". Those sequences for 
which no "Hit E Value" is listed are those axons which were 
found to have no similar sequences. 

As sorted, Table 4 thus lists its respective 
probes (by "AMPLICON SEQ ID NO.:" and additionally by the 

25 SEQ ID NO:, of the axon contained within the probe :"EXON 
SEQ ID NO.:") from least similar to sequences- known to be 
expressed (i.e., highest BLAST E value), at the beginning 
of the table, to most similar to sequences known to be 
expressed (i.e., lowest BLAST E value), at the bottom of 

30 the table. 

Table 4 further provides, for each listed probe, 
the accession number of the database sequence that yielded 
the "Most Similar (top) Hit BLAST E Value", along with the 
name of the database in which the database sequence is ■ 
35 found ("Top Hit Database Source") . 
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Table 4 further provides SEQ ID NOS. 
corresponding to the predicted amino acid sequences where 
they have been determined for the probe and exon nucleotide 
sequences. These are set out as PEPTIDE SEQ ID NOS.:. The 
5 peptide sequences for a given exon are predicted as 

follows: Since each chip exon is a consensus sequence drawn 
from predictions from various exon finding programs (i.e. 
Grail, GeneFinder and GenScan) , the multiple initial ORFs 
are first determined in a uniform way according to each 

10 prediction. In particular, the reading frame for predicting 
the first amino acid in the peptide sequence always starts 
with the first base of any codon and ends with the last 
base of non-termination codon. Next, for each strand of the 
exon, initial ORFs are merged into one or more final ORFs 

15 in an exhaustive process based on the following criteria: 
1) the merging ORFs must be overlapping, and 2) the merging 
ORFs must be in the same frame. 

The Sequence Listing, which is a superset of all 
of the data presented in Table 4, further includes, for 

20 each probe, the most similar hit, with accession number and 
BLAST E value, from the each of the three queried 
databases . 

Table 4 further lists, for each probe, a portion 
of the descriptor for the top hit ("Top Hit Descriptor") as 

25 provided in the sequence database. For those ORFs that are 
similar in sequence, but nonidentical to known sequences 
(e.g., those with BLAST E values between about le-05 and 
le-100), the descriptor reveals the likely function of the 
protein encoded by the probe's ORF. 

30 Using BLAST E value cutoffs of le-05 (i.e., 1 x 

10"^) and le-100 (i.e., 1 x 10"-^°°) as evidence of similarity 
to sequences known to be expressed is of course arbitrary: 
in Example 2, supra, a BLAST E value of le-30 was used as 
the boundary when only two classes were to be defined for 

35 analysis (unknown, >le-30; known <le-30) (see also FIG. 8) . 
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Furthermore, even when the "Most Similar (Top) Hit BLAST E 
Value" is low, e.g., less than about le-100 — which is 
probative evidence that the query sequence has previously 
been shown to be expressed — the top hit is highly unlikely 
5 exactly to match the probe sequence. 

First, such expression entries typically will not 
have the intronic and/or intergenic sequence present within 
the single exon probes listed in the Table. Second, even 
the ORF itself is unlikely in such cases to be present 
10 identically in the databases, since most of the EST and 
mRNA clones in existing databases include multiple exons, 
without any indication of the location of exon boundaries. 

As noted, the data presented in Table 4 represent 
a proper subset of the data present within the attached 
15 sequence listing. For each amplicon probe (SEQ ID NOs . : 1 
- 9,980) and probe exon (SEQ ID NOs.: 9,981 - 19,771, 
respectively), the sequence listing further provides, 
through iterated annotation fields <220> and <223>: 

, . (a) the accession number of the BAC from which 
20 the sequence was derived ("MAP TO"), thus providing a link 
to the Qhromosomal map location and other information about 
the genomic milieu of the probe sequence; 

(b) the most similar sequence provided by BLAST 
query of the EST database, with accession number' and BLAST 

25 E value for the "hit"; 

(c) the most similar sequence provided by BLAST 
query of the GenBank NR database, with accession number and 
BLAST E value for the "hit"; and 

(d) the most similar sequence provided by BLASTX 
30 query of the SWISSPROT database, with accession number and 

BLAST E value for the "hit". 



EXAMPLE 5 

35 Genome-Derived Single Exon Probes Useful For Measuring 
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Expression of Genes in Human Heart 

Table 4 (413 pages) presents expression, homology, and 
functional information for the genome-derived single exon 
5 probes that are expressed significantly in human heart. 
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Homo sapiens LSS gene, partial, exons 15, 16, 17 and 18 | 




factor IIH polypeptide 2 (Gtf2li2) genes, complete cds | 


factor IIH polypeptide 2 (Gtf2h2) genes, complete cds | 






Mus musculus AT3 gene for antitlirombin, complete cds | 


IMMEDIATE-EARLY PROTEIN 1 (IE1 ) (IMMEDIATE-EARLY PHOSPHOPROTEIN PP89) | 


Leuciscus cephalus orientalis cytochrome b (cyt b) gene, partial cds; mitochondrial gene for mitochondrial 1 
Iproduct 1 




601651038R1 NIH_MGC_81 Homo sapiens cDNA clone IMAGE:39345G2 3' | 
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283551 




i 










252301 


25701 1 
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1 


i 


5678 
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1 
\ 


Cynops pyrrhogaster CpTbxS premature mRNA, partial cds | 


Cynops pyrrhogaster CpTbxS premature mRNA, partial cds | 


Homo sapiens insulin receptor substrate 1 (IRS1) mRNA 1 




z 

5 
< 
< 




THROMBOSPONDIN 1 PRECURSOR | 


THROMBOSPONDIN 1 PRECURSOR | 


602128876F1 NIH_MGC_56 Homo sapiens cDNA clone IMAGE:4285506 5' | 


i 
1 

8 

i 
1 

5 

i 


HISTIDINE-RICH GLYCOPROTEIN PRECURSOR | 






ZINC-FINGER PROTEIN 1 (ZINC-FINGER HOMEODOMAIN PROTEIN 1) | 


ZINC-FINGER PROTEIN 1 (ZINC-FINGER HOMEODOMAIN PROTEIN 1) | 


,Arabidopsis thaliana DNA chromosome 4, contig fragment No. 91 | 


IHYPOTHETICAL 1 7.3 KDA PROTEIN IN MRDA-PHPB INTERGENIC REGION | 


lARGININE KINASE (AK) | 


WD-40 REPEAT PROTEIN MSI3 | 




f2 

i 

5 

p 
i 
1 

Q 


|za07o1 1 ,r1 Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:291860 5' | 


[zaG7c1 1 n Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:291 850 5' | 


lOUTER CAPSID PROTEIN VP4 (HEMAGGLUTININ) (OUTER LAYER PROTEIN VP4) [CONTAINS: 1 
loUTER CAPSID PROTEINS VPS AND VP8] | 


IHYPOTHETICAL 157.0 KDA PROTEIN 038010.5 IN CHROMOSOME III | 


[URIDYLATE KINASE (UK) (URIDINE MONOPHOSPHATE KINASE) (UMP KINASE) | 


[URIDYLATE KINASE (UK) (URIDINE MONOPHOSPHATE KINASE) (UMP KINASE) | 


[PROBABLE CATION-TRANSPORTING ATPASE C6G3.05C | 


ENV POLYPROTEIN [CONTAINS: COAT PROTEIN GP52; COAT PROTEIN GP36] j 


complete cds 


[ Homo sapiens DESC1 protein (DESC1 ), mRNA | 


Top Hit 


II 


NT 


[NT 1 


[NT 1 


z 


ISWISSPROT 1 


iNT 1 


ISWISSPROT 1 


ISWISSPROT 1 


1 


|SV/ISSPROT 1 


jSWiSSPROT 1 


iNT 1 


NT 


[SWISSPROT 1 


jSWiSSPROT 1 


NT _ 1 


1 SV/ISSPROT 1 


ISWISSPROT 1 


1 SV/ISSPROT 1 


ISV/ISSPROT 1 


[SV/ISSPROT 1 


lEST HUMAN 1 


[EST HUMAN | 


IsWISSPROT 1 


1 


ISWISSPROT 1 


1 
w 


SWISSPROT 


ISV/ISSPROT 1 


NT 


|NT 1 


TooHitAcession 




i 

i 

< 


i 

i 


50318041 


'AJ131719.1 1 


1 


'AL445065.1 | 


IP35441 1 


IP35441 - 1 


8 
1 


i 




IL12051.1 I 


IL12051.1 




IP2B166 1 


lAL1 61 595.2 




IP48610 




1 


[P44834 1 


1 


[W03412.1 


P36307 


8 
a 


IQ9ZE07 


IQ9ZE07 


Q10309 


P03374 


AF155142.1 


S 

1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


8.7E+00 


8.7E+00I 


1 


1 .8.1E+00| 


1 8.0E+00| 




'.5E+00 


'.5E+00| 


'.4E+00| 


f 


f 


'.2E+00| 


'.2E+00 


ME+00| 


ME+00 


ME+00| 


1 


'.OE+00 1 


'.OE+OO] 


1 6.9E+00| 


1 6.9E+00[ 


6.8E+00 


1 6.8E+00[ 


1 6.8E+00 


6.8E+00 




6.6E+O0I 


6.6E+00I 


6.5E+00I 


5.9E+OO; 


f 






























1 


Signal 


1.71 


1.71 






2.31 


t 2.071 


1.53 


1.53 


2.581 


3.83| 




































2.17, 


8.321 


7.16 




ORFSEol 


ID NO: 


25871 


25872 


R 












i 


271551 


271561 




22672 


26254 


262551 




i 












i 






1 


ft 




1 27372 






Exon 
SEQ ID 
NO: 


15754 


15754 


S 
S 


16452 


i 


i 


16747 


16747 


i 


1 


1 






16105 


16105 


17386 


18577 


17585 


8443 


i 


17775 


1 


i 


17154 




1 


1 


18344 


17172 


? i 


s 
































































Probe 


SEQ ID 
NO: 


5848 






1 


i 


s? 

S3 






5544 


7085 


7085 


2947 


2947 




1 


1 75351 




7735 


8575 


i 


7925 




i 


1 7277 


1 


7796 


1 


8471 




I 1 


1 
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o O I 

a. a. • 

iii 

2 CJ o 

III 



o z K 

O O " 



c3od 
lis 



2S5 



PI 
lit 



ii 



111' 



II 
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Top Hit Descriptor 


< 
E 
f 

8 

a: 

1 
1 
1 




ICYR61 PROTEIN PRECURSOR (3CH61) | 


E 

1 


RETINAL GUANYLYL CYCLASE 2 PRECURSOR (GUANYLATE CYCLASE 2F, RETINAL) (RETGC-2) 
|(ROD OUTER SEGMENT MEMBRANE GUANYLATE CYCLASE 2) (R0S-GC2) (GUANYLATE CYCLASE 
|f) (GC-F) 


RETINAL GUANYLYL CYCLASE 2 PRECURSOR (GUANYLATE CYCLASE 2F, RETINAL) (RETGC-2) 
l(ROD OUTER SEGMENT MEMBRANE GUANYLATE CYCLASE 2) (R0S-GC2) (GUANYLATE CYCLASE 

jF) (GC-F) 




F.pringlei gdcsPA gene for P-protein of the glycine cleavage system | 


iBRAIN-SPECIFIC ANGIOGENESIS INHIBITOR 1 PRECURSOR | 


iBRAIN-SPEClFlC ANGIOGENESIS INHIBITOR 1 PRECURSOR | 


[ADHERENCE FACTOR (ADHESION AND AGGREGATION MEDIATING SURFACE ANTIGEN) ( 


Buxus harlandil maturase K (matK) gene, partial cds; chloroplast gene for chloroplast product | 




1 
E 

i 
1 

i 




1 
E 

f 
1 

f 
1 
1 

J 

i 


Homo sapiens apoA polymorphism Kringle IV gene, exons 1 and 2 | 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation 1 


|CM0-BT0281-031199-O87-h04 BT0281 Homosapiens cDNA | 




Mus musculus SRY-box containing gene 13 (Sox13), mRNA | 


|MusmusculusSRY-boxcontaininggene13(Sox13), mRNA | 




1 faba bean necrotic yellows virus C2-Eg gene, isolate Egyptian EV1 -93 | 


ifaba bean necrotic yellows virus C2-Eg gene, isolate Egyptian EVl-93 | 






I Homo sapiens ATP-binding cassette, sub-family B (MDRH-AP), member 4 (ABCB4), mRNA 


1 Aspergillus nidulans recQ gene for DNA helicase, exons 1-4 | 


Top Hit 


NT J 


NT J 


ISWISSPROT 1 


NT 


SWISSPROT 


SWISSPROT 


iNT 1 


NT J 


[SWISSPROT 1 


ISWISSPROT 1 


ISWISSPROT 1 


iNT 1 


NT 


NT 




iNT 1 


NT _ 


iNT 1 


lEST HUMAN J 


NT J 


NT 1 


^ J 




NT 1 


NT _ 


iNT 1 


iNT 1 


NT 


|NT 1 


.i 






















































1 90551931 






Top Hit Aces! 




i 


|P18406 


5 
1 


P51842 


P51842 


iAE002225.2 


IZ36S79.1 


I014S14 


I014514 


§ 
1 


1 
< 


i 

< 


s 




1 


IL14005.1 


|AL1 16459.1 


i 


|AF068749.1 






i 

< 


IAJ132180.1 


AJ1 321 80.1 


lALI 61 540.2 




AJ271 844.1 


MostSimiiar 
(Top) Hit 
BLAST E 
Value 


3.0E+00| 


1 3.0E+00| 


1 


1 3.0E+00I 


3.0E+00 


3.0E+00 


2.9E+q0j 


2.9E+00I 


_ 2,9E+00| 


1 


2.9E+00I 


2.8E+0C 


2.8E+00 


2.8E+00I 


1 




t 2.7E+00I 


2.7E+00] 


1 2.7E+00] 


2.6E+00] 


1 2.6E+00] 


1 2.6E+00] 


2.6E+Oo| 


1 2.6E+00] 


2.6E+00 


1 


2.6E+00] 


1 2.6E+00] 


1 2.5E+00] 


Expression 
Signai 




1 1-63| 








6.51 




1.63| 






5.19| 


_ 5.79 










1.75 


2.21 ] 




I 4.35 






5.42I 














ORF SEQ 
ID NO: 


22535 


1 249701 






28501 


28502 


1 217471 


261121 


§ 


26301 


26433 


21212 






1 20012] 


5 


i 25377] 






! 24253 










i 


1 27534] 








Exon 
SEQ ID 
NO; 


12737 


1 151951 




1 17041] 


18251 


i 


1 




1 161461 


1 


1 162711 




1 115191 


1 




i 




] 17062] 


1 




] 15324] 


] 15324] 


i 


16576 


i 


] 17418] 


] 17777] 


i 


1 11353] 


Probe 
SEQ ID 
NO; 


2807 


f2 
S 




1 71641 


8374 




1 


1 61291 


i 


i 


i 


1441 










1 


] 71851 


i 




5405 


1 54051 




i 


i 


1 75671 


7927I 


9711 
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TopHitDescriptDr 


Aspergillus nidulans recQ gene for DNA helicase, exons 1-4 " | 


z 
o 

1 

< 
o 

m 


TEICHOIC ACID BIOSYNTHESIS PROTEIN F 


TEICHOIC ACID BIOSYNTHESIS PROTEIN F 


ITEICHOIC ACID BIOSYNTHESIS PROTEIN F | 


IQV4-FT0005-1 10500-205-g07 FT0005 Homo sapiens cDNA 1 


1 Rice DNA for aldolase C-1 , complete cds 1 


1 Mus musculus EIF4H gene, part' al cds; L1MK1 gene, complete cds; and ELN gene, partial cds | 


Chicken alpha-3 collagen type VI mRNA, 3' end | 


1 Homo sapiens double C2-like domains, alpha (D0C2A) mRNA | 


IVITELLOGENIN 1 PRECURSOR (YOLK PROTEIN 1) | 


|CD27L RECEPTOR PRECURSOR (T-CELL ACTIVATION ANTIGEN CD27) (T14) | 


|CD27L RECEPTOR PRECURSOR (T-CELL ACTIVATION ANTIGEN CD27)(T14) | 


Helicobacter pylori, strain J99 section 47 of 132 of the complete genome | 


lRC2-PT0004-0312gg-011-d05 PT0004 Homo sapiens cDNA | 


|ENDOCHITINASEB PRECURSOR (CHN-B) 1 


SKIN GRANULE PROTEIN PRECURSOR 


jSKIN GRANULE PROTEIN PRECURSOR | 


H:sapiens CTGF gene and promoter region | 


[XYLULOSE KINASE ()C<LULOKINASE) | 


|hr63f05.x1 NCI_CGAP Kidi 1 Homo sapiens cDNA clone IMAGE:3133187 3' 1 


|hr63f06.x1 NCl_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:3133187 3' | 






[ 

[ 

3 
1 
1)- 

I 

I 

< 

i 
\ 

I 
i 
\ 
I 
I 
\ 


1 G.domesticus artificial single chain antibody gene (L3) | 


IBos taurus partial cytb gene for cytochrome b 1 




1 MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN B) (CENP-B) | 




|ALPHA-(1 ,3)-FUCOSYLTRANSFERASE (GALACTOSIDE S-L-FUCOSYLTRANSFERASE) 1 
(FUCOSYLTRANSFERASE 4) (FUCT-IV) 


I502069121F1 NIH MGC 58 Homo sapiens cDNA clone IMAGE:40681 73 5' | 


I502059121F1 NIH_MGC_58 Homo sapiens cDNA clone IMAGE:40681 73 5' | 


Top Hit 
Database 
Source 




ISWISSPROT 1 


ISWISSPROT 1 


ISWISSPROT 1 


ISWISSPROT 1 


[EST HUMAN | 










ISWISSPROT 1 


ISWISSPROT 1 


1 

% 




lEST HUMAN | 


ISWISSPROT 1 


ISWISSPROT 1 


ISWISSPROT 1 


z 


ISWISSPROT 1 


|EST HUMAN 1 


|EST HUMAN | 


z 


z 


z 


z 


z 


ISWISSPROT 1 


z 


SWISSPROT 


'EST HUMAN | 


lEST HUMAN | 


Top HitAcession 
No. 


J271 844.1 1 


13485 1 


1 


13485 1 


13485 1 


|AW949158.1 1 


|D50307.1 1 


1 
< 


|M24282.1 1 




IP02843 1 


^26842 1 




§ 
< 


IAW875125.1 1 








1X92511.1 1 




IBE326702.1 1 


IBE32570Z1 | 


IY14079.1 1 


AF1 58652-2 


IZ46724.1 1 


IAJ4O1081.1 1 




IP07199 1 


1X50255.1 1 


Q11127 


IBF541987.1 | 


IBF54198/.1 1 












Most Similar 
(Top) Hit 
BLAST E 
Value 


t 2.5E+00| 




I 2.5E+00I 


2.5E+00| 


I 2.5E+00I 


t 2.5E+00I 


f 

^ 


2.5E+00I 


1 2.4E+00I 




2.4E+00I 


2.4E+00| 


1 
% 


2.4E+00I 




2.4E+00 


1 2.4E+00I 


2.4E+00I 




2.4E+00I 


i 


2.4E+00| 


2.4E+00I 


i 


2.3E+00I 


2.3E+00 


2.3E+00| 


2.3E+00I 


2.3E+00I 


2.3E+00 


2.3E+00I 


2.3E+00I 


Expression 
Signal 


















i 


S 








8 




" 




?3 




? 








s 


i 








« 








ORF SEQ 
ID NO: 


1 212171 






1 25537 1 


1 25538 1 




1 




1 22707] 


1 24485| 






1 






27194 




1 278541 






1 280021 
















1 


27364 






Exon 
SEQ ID 
NO: 




15468 






15458 
















i 
















17764 




18302 




1 


i 






1 






i 


































































Probe 
SEQ ID 
NO: 


1 1448 




1 


1 




1 


7264 


1 


1 


i 


1 


I 6733 1 




1 




7125 






7820 


1 


1 79141 


7914 




i 




1 


6408 


1 


1 65731 




i 


3 
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Top Hit Descriptor 


S01433673F1 NIH MGC 72 Homo sapiens cDNA clone IMAGE:3918643 5' | 


Neurospora orassa G protein alpha subunit GNA-3 (gna-3) gene, complete cds | 


Rat gene for regucalcin, exoni (non-coding exon) | 


Rat gens for regucalcin, sxoni (non-coding exon) | 


SORTILIN-RELATED RECEPTOR PRECURSOR (SORTING PROTEIN-RELATED RECEPTOR 
CONTAINING LDLR CLASS A REPEATS) (MSORLA) (SORLA-1) (LOW-DENSITY LIPOPROTEIN 
RECEPTOR RELATIVEWITH 11 LIGAND-BINDING REPEATS) (LDLR RELATIVE WITH 11 LIGAND- 
BINDING REPEATS) (LRU) (> 


SORTILIN-RELATED RECEPTOR PRECURSOR (SORTING PROTEIN-RELATED RECEPTOR 
CONTAINING LDLR CLASS A REPEATS) (MSORLA) (SORLA-1) (LOW-DENSITY LIPOPROTEIN 
.RECEPTOR RELATIVE WITH 11 LIGAND-BINDING REPEATS) (LDLR RELATIVE WITH 11 LIGAND- 
BINDING REPEATS) (LRU) (> 


1 S00943401 T1 NIH_MGC_1 7 Homo sapiens cDNA clone IMAGE:2959777 3' | 


MINOR VIRION STRUCTURAL PROTEIN MU-2 


nl95b02.s1 NCl_CGAP_Co10 Homo sapiens cDNA clone IIV1AGE:1058379 3' | 


G 

i 
1 

Q 

i 

1, 
1 

1 


|601594733F1 NIH MGC 9 Homo sapiens cDNA clone IMAGE:3948561 5' | 


ITRANSPOSON TY1 PROTEIN A 1 


similar to gb:Y00433 GLUTATHIONE PEROXIDASE (HUMAN); | 


qmS9b03.x1 Soaresjlacenta 8to9weeks 2NbHP8to9W Homo sapiens cDNA clone 1MAGE:1893965 3' 
similar to gb:Y00433 GLUTATHIONE PEROXIDASE (HUMAN); . | 


IS01855591F1 NIH MGC 57 Homo sapiens cDNA clone IMAGE:4075391 5' | 




lUROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCOPROTEIN) (THP) | 


lEARLYEIA 28 KD PROTEIN | 


Mus musculus pre-T cell receptor alpha gene, enhancer region and upstream region | 


i 

i 
IS 
1 

z 

.1 
i 

s 

o 

1 

? 

5 


|ALPHA-2-HS-GLYC0PR0TEIN PRECURSOR (FETUIN-A) | 


lyyOBalO.sl Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:270618 3' similar to gb:M55654 1 
TRANSCRIPTION INITIATION FACTOR TFIID (HUMAN); | 


|AU123530 NT2RM2HomosapienscDNAcloneNT2RM2000671 5' [ 


|Homo sapiens p22Dokdel (DOKDEL) mRNA, complete cds | 


bp Hit 

jfabase 
lource 










1- 
O 


1- 

O 




si 


li 


1 




O 


1 


1 






O 


1- 

O 






O 




? 


























a: 
1 












a: 








5 
i, 


5 






w' 




z 






SIMS 




1 i 














i 




i 


i 






i 






& 


HitAcession 




i 










i 


il 


i 


§ 


i 






1 


46782.1 1 


i 




1 




i 

1 


IO70159 1 






80527.1 1 






1 


§ 


1 
a 


1 

o 


0 




S 1 




1 


i 


§ 


< 


< 




< 


£ 






1 


i 






MostSimiiar 
(Top) Hit 
BiJ\STE 
Value 


2.3E+0CI 


2.3E+00I 


2.2E+0CI 


2.2E+0C| 


2.2E+0C 


? 


? 


¥ ' 


2.2E+Co| 


2.2E+00I 


2.2E+C0I 


2.2E+00| 


2.2E+00 


2.2E+0C 


2.2E+0CI 


\ 2.2E+CC1 


1 


1 2.2E+C0I 


1 


1 2.1E+CCI 


1 2.1E+C0I 


2.1E+CC 


2.1E+0CI 


? 


"signal 










10.08 




1 




8. ? 


1 51.561 






18 
























ORF SEQ 
ID NO: 


1 




23891 




24973 




i 


li 










1 


1 


27902 




1 281321 


1 






§ 


1 








19002 


19405 






i 




1 




11 


163441 




1 






1 


1 


1 


1 






§ 




i 


1 




















































Probe 
SEQ ID 
NO: 


93041 


1 


1 42161 


1 42161 


5276 


i 


i 


ii 


> S 


1 










7812 


7975 


1 


1 






8 


i 


i 





110 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCTAJSOl/00666 



11 



^1 



SI 



II 



111 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCTAJSOl/00666 





3 

I 

! 

& 




[RattUE norvegicus Actln-related protein complex 1b (Arpclb), mRNA ) 


|S01903309F1 NIH MGC 19 Homo sapiens cDNA clone IMAGE;4135586 5' | 


1 LEVANSUCRASE (BETA-D-FRUCTOFURANOSYL TRANSFERASE) (SUCROSE 6-FRUCTOSYL 1 
[TRANSFERASE) | 




|oz43h05.x1 Scares NhHMPu SI Homo sapiens cDNA clone IMAGE:1678137 3' | 


[levansucrase (beta-d-fructofuranosyl transferase) (sucrose 6-fructosyl 1 
[transferase) 1 


1 CM0-BT0282-1 71 299-1 27-e05 [3T0282 Homo sapiens cDNA | 


CM0-BT0282-1 71 299-1 27-e05 BT0282 Hcma sapiens cDNA | 


1 COUP TRANSCRIPTION FACTOR 1 (COUP-TF1 ) (COUP-TF 1) 1 


|601894255F1 NIH_MGC_17 Homo sapiens cDNA clone IMAGE:4140084 5' | 


HOMEOBOX PROTEIN DLX-3 


HOMEOBOX PROTEIN DLX-3 


|67B7 Human retina cDNA TspSOQI-cleaved sublibrary Homo sapiens cDNA not directional 1 


tu82d07.x1 NCI_CGAP_Gas4 Homo sapiens cDNA clone IMAGE:2257549 3' similar to contains MSR1 .t1 | 
MS R1 repetitive element ; | 


qfSObOl .x1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:1753417 3' similar to contains LI .t1 LI 1 
repetitve element; 




[ Homo sapiens small proline-rich protein (SPRR3) gene, exons 1 , 2, and 3 and complete cds | 


[ Mus musculus ST5GalNAclll gene, exon 2 1 




Zd25f01.r1 Soaresjetal heart_NbHH19W Homo sapiens cDNA clone IMAGE:341 689 5' similar to 
gb:D29805 N-ACETYLLACTOSAMINE SYNTHASE (HUMAN); 


[602185095T1 NIH MGC 45 Homo sapiens cDNA clone IMAGE:431 0591 3' | 


[Homo sapiens proliferation-associated SN F2-like protein (SMARCA6) mRNA, complete cds | 


[ Homo sapiens proliferation-associated SNF2-like protein (SMARCA6) mRNA, complete cds | 


1 

1 
1 


[Mus musculus ST5GalNAclll gene, exon 2 | 


[ Brachydanio rerio MHC class II DA-beta-2'01 gene, 3' end 


[RC0-GT0415-200700-032-C10 CT0415 Homo sapiens cDNA | 


Top Hit 


Source 




iNT 1 


|EST HUMAN | 


SWISSPROT 1 


NT 


lEST HUMAN 1 


SWISSPROT 1 


|EST HUMAN | 


[EST HUMAN | 


[SWISSPROT 1 


|EST HUMAN | 


[SWISSPROT 1 


[SWISSPROT 1 


[EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


NT 


NT 1 


NT 


iNT 1 


1 

1 


[EST HUMAN ] 


iNT 1 


NT . 1 


iNT 1 


NT _ 


[NT 


Z 

D 

[3 


s 


No. 


i 

< 




|BF31 6805.1 1 


§ 
O 


|AL1 63280.2 1 


IAI141067.1 1 


§ 

o 


|BE06354S.1 | 


|BE063545.1 | 


|Q9TTR8 1 


IBF308000.1 1 


S 

i 




|W22424.1 1 


AI678443.1 | 


AI198573.1 


[AF1 99339.1 1 


[AF077374.1 | 


5 
>■ 


[X98373.1 1 


W58426.1 


[BF570077.1 | 


i 
< 


[AF155827.1 1 


[Y1 1344.1 1 


[Yl 1344.1 1 


1 


[BE697257.1 | 


Most Similar 
(Tod) Hit 


BLAST E 
Value 


.8E+Oo| 


.8E+00| 


f 


.7E+Oo| 


.7E+00| 


.7E+00| 


.7E+Oo| 


.7E+00| 


8 


.7E+00| 


.7E+00I 


.7E+00I 


.7E+00| 


.7E+00I 


.7E+Oo| 




? 


? 


I 


.6E+00 


1 


00+39- 


f 


1 


|00+39' 


.6E+00| 


00+39- 


|00+39- 


























































1 
1 


Signal 










1 3.25| 


1 1.021 


i 








1 1.291 


2.15 




[ 1.78| 




c\i 




1 3.61 1 








1 5.14| 


1 1.441 




1 3.141 




1 2.19| 


S 


ORF SEQ 


ID NO: 








i 

R 


1 220131 


1 221011 


1 




254251 


256731 




273551 


273561 


[__ 289771 




25185 


217751 




i 










s 


24657| 


1 




25317| 


Exon 


:0N 


S 

s 


19119 


1 


1 


12110 


12202 










1 


1 


1 


s 


19044 


i 


1 






12122 




13854 


14154 






§ 


15474 


1 


























































Probe 


NO: 


1 9428 


1 


1 


1 1092 


2225 


2321 




5448 


5448 


i 


1 70141 


7282 


1 72821 


1 8874 [ 




1 


1 


1997 


1 20031 


1 2238| 


»3 


i 


1 4255 


1 4255j 


1 50161 


5016 


1 


i 



112 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



8 I 



113 



wo 01/57274 PCT/USOl/00666 




114 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



Top Hit Descriptor 


1 M.mucedo gene encoding 4-Dihydromethyl-trisporate dehydrogenase | 


1 Cantharellus sp. partial 25S rRNA gene, isolate Tibet | 


Homo sapiens putative psihHbA pseudogene for hair keratin, exons 2 to 7 | 


1 Homo sapiens zinc finger protein 157 (HZF22) (ZNF1 57) mRNA 1 


[Homo sapiens zinc finger protein 157 (HZF22) (ZNF1 57) mRNA | 


1 Coix lacryma-jobi dihydrodipicoiinate synthase (dapA) gene, complete cds 1 




and MASP-related protein, complete cds | 


;601661233R1 NIH_MGC_/2 Homo sapiens cDNA clone IMAGE:3915945 3' | 


1 
E 

S 
1 

1 
1 

1 
1 


protein (P55), synaptic vesicle-associated integral membrane protein (VAMP-1 ), procollagen C-proteinase 
enhancer protein (PCOLCE) genes, complete c> 


Mus muscuius aipha-spectrin 1, erythroid (Spnal), mRNA | 


Plasmodium reichenowi partial 83/AMA-1 gene for apical membrane antigen 1 | 


1 Plasmodium reichenowi partial 83/AMA-1 gene for apical membrane antigen 1 1 


PM0-CT0289-291 ■ 99-004-f08 CT0289 Homo sapiens cDNA | 


PM0-CT0289-29r;99-004-f08 CT0289 Homo sapiens cDNA | 


□.melanogaster no-on-transient A gene product, complete cds | 


1 
1 
1 


I601657145R1 N1H_MGC_67 Homo sapiens cDNA clone IMAGE:3866195 3' | 


Homo sapiens GL004 protein (GL004), mRNA | 




S.alba phr-1 mRNA for photolyase 1 


S .alba phr-1 mRNA for photolyase | 


LYSOSOMAL ALPHA-MANNOSIDASE PRECURSOR (MANNOSIDASE, ALPHA B) (LYSOSOMAL ACID 1 
ALPHA-MANNOSIDASE) (LAMAN) | 




601657145R1 NIH_MGC_67 Homo sapiens cDNA clone IMAGE:3866195 3' | 






DIHYDROPYRIMIDINASE (DHPASE) (HYDANTOINASE) (DHP) 


MRNA 3'-END PROCESSING PROTEIN RNA1 5 | 


Top Hit 

Database 
Source 




1- 














EST HUMAN | 












EST HUMAN | 


EST HUMAN | 






EST HUMAN 1 










o 


EST HUMAN | 


EST_HUMAN | 






SWISSPROT 1 


SWISSPROT 1 


TopHitAcession 
No. 


IZ73640.1 1 


IAJ271 192.1 1 


|Y19213.1 1 


i 


45079981 




tAE002338.2 | 


AB030447.1 | 


IBE966735.2 | 


6755521 1 


AF016494.1 


6755621 1 






AW362834.1 | 


i 

1 


M33496.1 1 


! 


i 


i 


:AFC42084.1 | 


X72019.1 1 


X72019.1 1 


OC0754 1 


i 
1 


BE983379.2 1 


AE004392.1 | 




Q14117 1 


P25299 1 


Most Similar 
(Top) Hit 
BIJ\STE 
Value 


.3E+00| 


.3E+00 




.3E+00| 


|oo+3e- 


.3E+00| 




.3E+Oo| 


.3E+00| 


1 






1 


1 


8 


.3E+00 


.3E+00| 




1 


.3E+00 


§ 


.3E+00 


.3E+00| 


.3E+00| 










1 
































































Expression 
Signal 












s 


§ 

(Vi 








P 




s. 
















? 




















ORFSEQ 
ID NO: 




1 206591 




1 210331 


1 210341 










1 226321 




1 


1 246071 




1 256741 


1 256751 




i 








1 275601 


1 


27639 


1 












Exon 
SEQID 
NO: 


1 


§ 










? 


i 




12835 


1 


1 


1 


14839 




15576 


15770 


1 






17350 


17357 


17357 


17425 






17716 




































































Probe 
SEQ ID 
NO: 




i 








i 




1 21961 


2503 


1 


3547 


1 45021 


i 


4964 


1 


i 


i 


1 


i 


1 


7480 


1 7487 1 


7487 


7574 




1 76751 


1 


1 78741 


8 


i 
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Top Hit Descriptor 


Mus musculus desmin gene | 


xp09e03j(1 NCI_CGAP_HN9 Homo sapiens oDNA clone IMAGE:2739868 3- | 


Human mRNA for KIAA0085 gene, partial cds | 


Bacillus subtilis genomic DNA 23.9kB fragment | 


i 
i 

c 
O 

1 
.1 

t 

1 
1 

a 
1 

o 


i 

1 
s 

1 

i 
.1 
§ 

1 
i 


E1 GLYCOPROTEIN PRECURSOR (MATRIX GLYCOPROTEIN) (MEMBRANE GLYCOPROTEIN) | 


? 

o 

3 
1 

g 
1 

i 

i 

z 

1 
i 
1 

1 
1 


HiSTIDiNE-RICH PROTEIN PRECURSOR (CLONE PFHRP-III) | 


HISTIDINE-RICH PROTEIN PRECURSOR (CLONE PFHRP-III) 


HISTIDINE-RICH PROTEIN PRECURSOR (CLONE PFHRP-III) | 


Homo sapiens hypottieScal protein PRO3077 (PRO3077), mRNA | 


Eiaeis oleifera sesquiterpene synthase mRNA, complete cds | 


pea seed-borne mosaic virus complete genome | 




Homo sapiens G-protein coupled receptor 1 4 (GPR14) gene, complete cds | 


Homo sapiens mRNA for K1AA0874 protein, partial cds | 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 63 | 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 63 1 


CONJUGAL TRANSFER PROTEIN TRBE PRECURSOR | 


Homo sapiens LHX3 gene, intron 2 | 


1 G.gallus T-cadherin mRNA, complete cds | 


Mus musculus subtilisin-like serine protease LPC (PC7) gene, exons 1 to 9, partial cds 1 


MRO-FT0175-050900-203-g06 1 FT0175 Homo sapiens cDNA | 


i 

5 
i 
1 
1 


Rattus norvegicus Glycine receptor alpha 2 subunit (glycine receptor, neonatal) (Glra2), mRNA | 


Rattus rattus cardiac AE3 gene, exons 1-23 | 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 21 ( 




T.pinnatum chloroplast rbcL gene, partial | 


G.gallus T-cadherin mRNA, complete cds, | 


MR3-ST01 91 -1 40200-01 3-c05 STC191 Homo sapiens cDNA | 


D.hydei ayl repeat cluster DNA, fragment D [ 


< 

I 
1 

i 

i 


Top Hit 

Database 
Source 




lEST HUMAN | 








|EST HUMAN ] 


ISWISSPROT 1 


|EST HUMAN | 


ISWISSPROT 1 


ISWISSPROT 1 


ISWISSPROT 1 


















ISWISSPROT 1 








[EST HUMAN ] 
















]EST HUMAN | 




lEST HUMAN | 


TopHitAcession 
No. 




|AV(/ 274791.1 1 


ID42042.1 1 




IAF187873.1 | 


|BF348043.1 | 


i 


IAA676246.1 | 


IP05228 1 


IP05228 1 


|P05228 1 


8924234] 


IAF080245.2 J 


1AJ252242.1 1 


|AJ252242.1 1 


IAF140631.1 1 


i 
1 


IAL161563.2 | 


i 
i 


IP54910 1 


IAF188740.1 1 


]M81779.1 1 


IU75902.1 1 


]BF373570.1 j 


JAF188740.1 1 


§ 

i 


i 

i 


i 




IY09200.1 1 


i 


JAW813276.1 1 


1 
X 


]BE003113.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 


.3E+00 




.3E+00| 


.3E+00| 


1 




§ 


.2E+00| 


g 


.2E+00| 








.2E+00| 




.2E+00| 


.2E+00| 




.2E+00l 


g 


.2E+00] 


















.2E+00| 


.2E+00] 










































































11 
|s 


^ 












§. 
















F: 














i 






q 










? 




^ 




















ORF SEQ 
ID NO: 


1 28424] 




1 28923 1 










1 203841 


1 20578 1 




1 20580 1 




1 20901 1 


209431 


§ 






I 22851 1 






23038] 




23350] 


S 
?3 


i 


] 23981 ] 






1 24145 








S 


1 25783] 


Exon 
SEQ ID 
NO: 


18178 


18510 




18705 


i 


19139 






10735 




10735 


1 


11058 


11097 


110971 


1 


1 


13053 






13232 


1 




13814 


i 




1 




s 








15647 








































































Probe 
SEQ ID 
NO: 


i 


1 


i 


1 


i 


9533 


1 


i 


1 


s 






1145 


1 11871 


1187 


1 




3128 


1 


1 


3311 




1 3S50| 


i 


1 42011 


1 


4371 


4422 


4460 




i 




1 57391 


i 



116 



wo 01/57274 PCTAJSOl/00666 




wo 01/57274 



PCT/USOl/00666 




/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 





i 

j 

\ 


Mus musculus Serfl protein (Serfl), survival of motor neuron protein (Smn), neuronal apoptosis intiibitory 
protein-rs6 (Naip-rs6). and neuronal apoptosis inhibitory protein-rs3 (Naip-rs3) genes, complete cds 


3-OXO-5-ALPHA-STEROID 4-DEHYDROGENASE 1 (STEROID 5-ALPHA-REDUCTASE 1) (SR TYPE 1) ( 


3-OXO-5-ALPHA-STEROID 4-DEHYDROGENASE 1 (STEROID 5-ALPHA-REDUCTASE 1) (SR TYPE 1) | 


HYPOTHETICAL 67.9 KD PROTEIN C6F1 2.08C IN CHROMOSOME 1 | 


af26g08.s1 SoaresJotaLfetus_Nb2HF8_9v/ Homo sapiens cDNA clone IMAGE:1032830 3' similar to 1 
WP:C42D8.3 CE04204 ;contains element MER22 MER22 repetitive element ; | 


Rattus norvegicus neuromedin U precursor (NmU) gene, exons 5 and 6 ( 




Agaricus bispcrus mRNA for tyrosinase | 


Homo sapiens calcium channel alphalE subunit(CACNAIE) gene, exons 7-49, and partial cds, alternatively 1 
spliced 1 


Homo sapiens hypothetical protein FU10139 (FLJ10139), mRNA | 


Taenia ovis 45W antigen (ToW4) gene, complete cds | 




Bacillus subtilis 42.7kB DNA fragmentfrom yvsA to yvqA | 


Hordeum vulgare gene encoding cysteine proteinase | 


{bos taurus micromolar calcium activated neutral protease 1 (CAPN1) gene, exons 11-20. and partial cds | 


Bos taurus micromolar calcium activated neutral protease 1 (CAPN1) gene, exons 11-20, and partial cds | 


z 

s 


0 

s 
1 

z 

.1 

i 

X 

1 

0 
5 

i 
1 


Mus musculus subti lsin-llke serine protease LPC (PG7) gene, exons 1 to 9. partial cds | 


B-CELL RECEPTOR CD22 PRECURSOR (LEU-14) (B-LYMPHOCYTE CELL ADHESION MOLECULE) 1 
(BL-CAM) 1 




ac79b08.s1 Stratagene lung (#937210) Homo sapiens cDNA clone IMAGE:868791 3' | 


601443950F1 NIH_MGC_65 Homo sapiens cDNA clone IMAGE:3848005 5' | 


601443950F'^ NIH MGC_65 Homo sspiens cDNA clone IMAGE:3848005 5' | 




Top Hit 


Source 




SWISSPROT 1 


SWISSPROT 1 


SWISSPROT 1 


EST HUMAN | 






z 


z 


z 




z 








z 


SWISSPROT 1 


EST HUMAN | 


z 


SWISSPROT 1 




EST_HUMAN | 


EST HUMAN | 


EST HUMAN | 




TopHitAcession 




AF131205.1 


P24008 


P24008 


i 

o 


AA628453.1 | 


AF222761.1 | 


U23S08.1 1 


i 

< 


AF223391.1 | 






D10852.1 


AJ223978.1 | 


Z97022.1 1 


AF248054.1 


AF248054.1 | 


P04501 1 


AW452782.1 | 


U75902.1 1 


P20273 1 


AF1 92531.1 1 


AA775191.1 1 


BE868267.1 | 




D1 0852.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 




1 


1 1. OE+OO 


1 


1. OE+OO 1 


1 


8 


1. OE+OO 1 


1 


1.0E+00I 


i 


00+30' P 


1. OE+OO 1 


I.OE+OOl 


1 1. OE+OO 1 


1. OE+OO 1 


I.OE+OOl 


00+301. 1 


I.OE+OOl 


I.OE+Ooj 


I.OE+OOl 


I.OE+OOl 


00+301. 1 


I 


I.OE+OOl 


§ 

1 - 




1 




1 








? 














5 




^ 










i 








ORF SEQ 
ID NO: 


22408 




1 




i 






233281 










_ 247771 


1 


1 


25553! 


Si 


257301 






i 




26713 1 






Exon 
SEQ ID 
NO: 








i 


i 






1 






1 




1 




i 




S 


i 








i 


i 


1 




Probe 
SEQ ID 
NO: 


8 






i 




3327 


1 


1 


s 


4178 


4815 




51391 


5228 


i 


1 






i 


i 


65371 


i 


1 


i 


6734 1 
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1 

i 


PEROXISOMAL HYDRATASE-DEHYDROGENASE-EPIMERASE (HDE) (MULTIFUNCTIONAL BETA- 
OXIDATION PROTEIN) (MFP) [INCLUDES: 2-ENOYL-COA HYDRATASE ; D-3-HYDR0XYACYL COA 
DEHYDROGENASE ] 


PEROXISOMAL HYDRATASE-DEHYDROGENASE-EPIMERASE (HDE) (MULTIFUNCTIONAL BETA- 
OXIDATION PROTEIN) (MFP) [INCLUDES: 2-ENOYL-COA HYDRATASE ; D-3-HYDR0XYACYL COA 
DEHYDROGENASE] 


RC1 -HT0229-1 81 099-01 1-e06 HT0229 Homo sapiens cDNA | 


Human immunodeficiency virus type 1 (HIV-1), Isolate SF33, | 


601497581F1 NiH_MGC_70 Homo sapiens cDNA clone IMAGE:3899421 5' | 


Mus musculus chloride channel calcium activated 1 (CIcal), mRNA | 


Mus musculus chloride channel calcium activated 1 (CIcal), mRNA | 


AV68S554 GKC Homo sapiens cDNA clone GKCCYA1 15' ' | 


1 
■5. 

i 

O 

1 

1 

o 

1 
1 

! 

i 


Xenopus laevis zona pellucida C gli^oprotein precursor (xlZPC) mRNA, complete cds 


5 

<. 

5 

i 

i 

.1 
§• 

0 

5 


i 

1 
< 

a 

j 

X 

i 

.1 

i 
1 

1 


o 

i 

1 
< 

a 

i 

! 

I 

i' 
% 
1 


THROMBOMODULIN PRECURSOR (FETOMODULIN) (TM) | 


EST388293 MAGE resequences, MAGN Homo sapiens cDNA | 


Homo sapiens mRNA for KIAA1517 protein, partial cds | 


Drosophila melanogaster regulator of G-protein signalling LOCO III mRNA, complete cds | 


Drosophila melanogaster regulator of G-protein signalling LOCO III mRNA, complete cds | 


Homo sapiens chromosome 21 segment HS21 CI 02 | 


Apple mosaic virus RNA 2 putaHve polymerase gene, complete cds | 


SERINEiTHREONINE PROTEIN KINASE MINIBRAIN | 




B2 BRADYKININ RECEPTOR (BK-2 RECEPTOR) | 


Danio rerio mRNA for Eph-like receptor tyrosine kinase rtkS | 


AMINO-ACID ACETYLTRANSFERASE (N-ACETYLGLUTAMATE SYNTHASE) (AGS) (NAGS) | 


601 653583R2NIH_MGC_55 Homo sapiens cDNA clone IMAGE:3838461 3' 1 


601653583R2 NIH_MGC_55 Homo sapiens cDNA clone IMAGE:3838461 3' | 


lii<e protein, isolate JM983 | 


Top Hit 
Database 
Source 


SWISSPROT 


SWISSPROT 


^1 


NT 1 


EST HUMAN | 


NT J 


NT 1 


EST HUMAN 1 


1 


NT 


ESTHUIIMN 1 


i 


EST HUMAN | 


SWISSPROT 1 


EST HUMAN 1 


NT 


NT 1 




NT 1 


NT . 1 


SWISSPROT 1 


NT 1 


SWISSPROT 1 


NT _ 


SWISSPROT 1 


EST HUMAN 1 


EST HUMAN | 




Top Hit Acession 


No. 


Q02207 


Q02207 


BE147331,1 1 


M38427.1 1 


BE907592.1 | 


1 


67534291 


AV689554.1 1 


U44952.1 1 


U44952.1 1 


AV758825.1 1 


AA004982.1 1 


AA004982.1 | 


P15306 1 


AW976184.1 1 




AF24545S.1 | 


AF245455.1 1 


AL1 63302.2 | 


1 
< 


P49657 1 


U65667.1 1 


i 

a 


i 
3 


P22567 


BE957439.2 1 


BE957439.2 | 


AJ302158.1 1 


Most Similar 
(Top) Hit 


BLAST E 
Value 


1. OE+OO 


.OE+OO 


OE+OO 


OE+OO 


OE+OO 


OE+OO 


OE+OO 


00+30- 


OE+OO 


OE+OO 


1 


OE+OO 


OE+OO 


OE+OO 


OE+OO 


OE+OO 


9.9E-01 


9.9E-01 


9.9E-01 


i 


9.9E-01 


9.9E-01 


9.SE-01 


S 


9.SE-01 


9.8E-01 


9.8E-01 


9.8E-01 
































i 
1 








2.01 1 








^ 








2.83 1 


17.C8| 


17.081 




1.51 1 






d 














2.32 








ORFSEQ 


ID NO: 










27405| 


1 


27527J 


276071 




276101 


280051 


280751 


1 












8 




i 








20265J 




23430 1 




Exon 
SEQ ID 
NO: 








1 
















7835 






























3644 
























































Probe 
SEQ ID 
NO: 


i 


i 


& 
g 


7064 


7339 


8 


1 


7544 


7546 1 


7646 


79161 


e 


s 


i 




1 




15531 


i 


35531 


5461 1 




7510 


i 




3732 


3732 


6272 
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Top Hit Descriptor 


like protein, isolate JM983 | 


601 1 10258F1 NIH MGC 16 Homo sapiens cDNA clone IMAGE:3350750 5' | 


601 1 10258F1 NIH_MGC_16 Homo sapiens cDNA clone IMAGE:3350750 5' 1 




we52e04.x1 Soares_thymus_NHFTh Homo sapiens cDNA clone IMAGE:2345694 3' | 


and e, partial cds ( 


Triticum aestivum stripe rust resistance protein Yrl 0 (Yrl 0) gene, complete cds 1 




i 

1 

G 
5 

< 

i 
1 

i 

o 
o 

i 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation | 


Bromus inermis putative cytosolic phosphoglucomutase (pgml ) mRNA, complete cds 1 


PM2-UM0053-240300-005-f12 UM0053 Homo sapiens cDNA 


Homo sapiens K1AA091 4 gene product (K1AA0914), mRNA 1 


Parvowrus B1 9 DNA, patient C, genome position 2448-2994 | 


Parvovirus B19 DNA, patient G, genome position 2448-2994 1 


P falciparum complete gene map of plastid-like DNA (IR-A) | 


i 

Q 

z 

1 

< 

i 
I 
1 
1 
1 
1 


AV752505 NPD Homo sapiens cDNA clone NPDBAG06 5' 1 


protein, partial cds 


501675S39F1 NIH MGC 21 Homo sapiens cDNA clone IMAGE:395B473 5' | 


601675S39F1 NIH MGC 21 Homo sapiens cDNA clone IMAGE:3958473 5' | 


RC1-CT0295-241 199-01 1-b02 CT0295 Homo sapiens cDNA | 




Pimpinella brachycarpa zinc finger protein (ZFP1) mRNA, complete cds | 


601466703F1 NIH_MGC_67 Homo sapiens cDNA clone IIWIAGE:3869929 5' ) 


homolog) (EGFR), mRNA ) 


Homo sapiens phytanoyl-CoA hydroxylase (PHYH) gene, exon 5 j 


III 






i 












i 






















i 




i 






1 






















































































































TopHitAcession 
No. 


|aJ302158.1 


IBE258705.1 | 


IBE258705.1 1 


U52111.2 


1 


|u26716.1 1 


IAFI 491 12.1 1 


^90544.1 1 


|BF51 1209.1 1 


IAL114281.1 1 


AF197925.1 j 


AW799674.1 


L 76523751 


IZ70556.1 1 


IZ70556.1 1 


S 


|AV752605.1 1 


|AV752605.1 1 


U91423.1 


|BE902340.1 | 


IBE902340.1 | 


|AW861 102.1 ] 


i 


IAF080595.1 ] 


|BE781251.1 1 


1 


|AF242382.1 j 


Similar 
P) Hit 
iSTE 


i 


5.8E-01 


3 


9 


9 


9 


9 


9 


9 


9 


9 = 
S c 


' 9 

; S 


i 


9 


i 


i 


9 


9 


i 


i 


i 


i 


i 


i 


i 


i 


9 


























































Expression 
Signal 


















































s 






:F SEQ 
3 NO: 


26293 


28487| 








1 




27025] 






24016 


24036 






254991 






289131 


i 




i 


1 












O 
























































Exon 
SEQ ID 
NO: 




1 


1 










i 


18382 




142341 


1 S 


1 


15435] 


154351 


i 


1 


S 






13627] 


17143 




' 13106] 


s 




1 


Probe 
SEQID 
NO: 


s 


i 


i 




5106 


i 


i 


i 


1 


1 






i 




5517 


i 


i 


1 




1 37151 






3165 




i 




1 
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Top Hit Descriptor 


|RC5-BT0503-271199-011-B01 BT0503 Homo sapiens cDNA | 




Bovine papillomavirus type 2, complete genome | 


Equus caballus microsatellite LEX01 3 ( 


Homo sapiens nuclear factor of kappa light polypeptide gene enhancer in B-cdls 1 (NFKB1 ) gene, complete 1 




oe09b03.s1 NCI CGAP Ov2 Homo sapiens cDNA clone IMAGE:1385357 | 


Aedes trisefiatus putative large subunit ribosomal protein rpL34 mRNA, complete cds | 


Homo sapiens chromosome Xq28 melanoma antigen family A2a (MAGEA2A), melanoma antigen family A12 
j(MAGEA12), melanoma antigen family A2b (MAGEA2B), mdanoma antigen family A3 (MAGEA3), caltractin 
(CALT), NAD(P)H dehydrogenase-like protein (NSDHL), and Ll> 


60 1 441 338T1 N 1 H_MGC_72 Homo sapiens cDNA clone IMAGE:391 61 84 3' | 




:Mus musculus carbonic anhydrase 4 (Gar4), mRNA 1 


7o58e06.x1 NCLCGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:3578219 3' similar to SW:NU5M_TRYBB 
P04540 NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 ; 


601820312F1 NIH_MGCJ8 Homo sapiens cDNA clone IMAGE:4052018 5' | 


6021 54759F1 NIH MGC 83 Homo sapiens cDNA clone IMAGE:4295896 5' | 


< 
E 

t 

1 

3 

1 
1 

1 
1 


Lb200G8R Infant brain, LLNL array of Dr. M. Soares 1 NIB Homo sapiens cDNA clone LLAB200G8 5' | 


1 AB200G8R Infant brain, LLNL an-ay of Dr. M. Soares 1 NIB Homo sapiens cDNA clone LLAB200G8 5' | 


Rattus norvegicus mucin (MUC2) gene, partial cds | 


INTERV^LPHA-TRYPSJN INHIBITOR HEAVY CHAIN H3 PRECURSOR (ITI HEAVY CHAIN H3) | 


CO 

E 
1 
0 

§ 
■§ 

z 

■1 
S 

1 
0 

CD 

1 
0 


Rattus norvegicus Rab3 GDP/GTP exchange protdn mRNA, complete cds | 


Homo sapiens uncoupling protein-3 (UCP3) gene, complete cds | 


Homo sapiens DKFZP564M2423 protein (DKFZP554M2423}, mRNA | 


Homo sapiens neurexin Ill-alpha gene, partial cds | 


Oryctolagus cuniculus Rad51 (RAD51) mRNA, complete cds | 


Xenopus laevis gene for alddase, complete cds | 


Top Hit 

Datalase 
Source 


[EST HUMAN | 












i 






lEST HUMAN 1 


|EST HUMAN | 




EST HUMAN 


lEST_HUMAN 1 


[EST HUMAN | 




EST HUMAN | 


EST HUMAN | 




ISWISSPROT 1 


EST HUMAN | 














Top HitAcession 
No. 


|BE071 172.1 1 


IM20219.1 1 


|M20219.1 1 


IAF075615.1 1 


|aF21 3884.1 


i 


IAA847040.1 | 




U82671.2 


|BE622702.1 1 


IBF037586.1 | 


1 6671677 


BF593251.1 


|BF1324a2.1 1 


IBF680047.1 | 


1 89230561 


T26418.1 1 


T26418.1 1 


IU68172.1 1 


a 


i 


IU72995.1 1 


IAF050113.1 1 


I 76616251 


jAFC99810.1 1 


|AF01 7729.1 1 


g 

s 


Most Similar 
(Top) Hit 
BLAST E 
Value 




1 9.3E-01I 


3 


1 9.3E-01 1 


1 9.3E-01 


1 9.3E-01 1 


1 9.3E-01I 


1 9.3E-01 1 


9.3E-01 


1 9.2E-01| 


1 9.2E-01 1 


1 9.2E-0lj 


9.2E-01 


1 9.2E-01| 


1 9.2E-01I 


1 9.1E-01| 


9.1E-01 


9.IE-O1I 


1 9.IE-OII 


1 9.1E-Q1I 


1 9.1E-011 


1 9.1E-01I 


1 9.1E-01I 


I 9.OE-OII 


1 9.0E-01 1 


9 


1 9.0E-01I 


O TO 

1* 






















^ 




















1 














ORF SEQ 
ID NO: 




1 


1 23631 1 




25410 












\ 25S4.I 


ii 


3 ? 




1 291011 








i 


§ 


§ 


s 




1 229001 




_ 245891 




Exon 
SEQ ID 
NO; 


1 




13857j 


S 


15354 


s 


1 


s 








If 


? 1 


1 


i 


11967 


§ 


i 


14207 






i 


i 




14181 


i 


s 
























































Probe 
SEQID 
NO; 




1 


i 


1 5155| 


i 


i 


1 67011 


i 


9954 


i 


1 




7952 


i 




2077 






4310 


1 


6472 


1 


i 


1 31701 


4282 


1 
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Top Hit Descriptor 


Fugu ruhripes neural cell adhesion molecule LI homolog (LI -CAM) gene, complete cds; putative protein 1 
'complete cds; and calcium channel alpha-1 subunit> 


iRabbitf^C fragment RLA-DF DNA | 


IXyiellafastidiosa, section 90 of 229 of the complete genome | 


Chlamydophlla pneumoniae AR39, section 21 of 94 of the complete genome | 


i 

Q 

< 
1 

9 

i 


|M.aerugi,nosa (HUB 5-2-4) DNA from plasmid PMA1 | 


Synechocystis sp. PCC68C3 complete genome, 13/27, 1576593-1719643 1 


Homo sapiens SCSI (SCSI) gene, partial cds | 


IHomo sapiens AT-binding transcription factor 1 (ATBF1). mRNA | 


'nn05f11.s1 NCLCGAP_Pr4.1 Homo sapiens cDNA clone IMAGE:1076877 ' | 


Pseudomonas aeruginosa topoisomerase (top), putative transcriptional regulatory protein OhbR (ohbR), ortho- 

'dioxvgenase alpha-ISP protein OhbB (ohbB), and put> 




,QV0-NN1021-100800-337-c03NN1021 Homo sapiens cDNA | 


I601823584R1 NIH MGC 79 Homo sapiens cDNA clone IMAGE:4043564 3' | 


601823684R1 N1H_MGC_79 Homo sapiens cDNA clone IMAGE:4043564 3' 


,AV661898 GLC Homo saptens cDNA clone GLCGYG07 3' | 


1 Rat IGFII gene for insulin-like growth factor II | 


O 
1 

1 

1 

1 
5 

1 
I 

1 




Arabidopsis thaliana DNA chromosome 4. contig fragment No. 65 | 


Drosophila melanogaster merlin (Dmerlin) mRNA. complete cds | 


1 
1 
1 

s 


Chicken lipoprotein lipase gene | 










Cyanidium caldarium gene for SigC, complete cds | 


Cyanidium caldarium gene for SigC, complete cds | 


Top Hit 
Database 
Source 


z 


h- 
z 


iNT 1 


NT 


1 
1 

i 


iNT 1 


NT 


iNT 1 


INT 1 


[EST HUMAN 1 


IN 


INT 1 


1 

I 

s 


1 

Z> 
X 


lEST HUMAN 1 


lEST HUMAN 1 


z 


lEST HUMAN 1 


z 


z 


iNT 1 


z 


iNT 1 


NT 


z 


iNT 1 


|EST HUMAN | 


NT 


NT 


Top Hit Acession 


No. 


i 
< 


1 


|AE003944,1 | 


(AE002186.2 | 


IO26350 1 


1228337.1 1 


i 


|AF1 06953.2 1 


1 59018931 


i 














1X17012.1 1 


1 


4503210 


lALI 61 565.2 | 


IU49724.1 1 


1X50547.1 1 


1X60547.1 1 


|AF1 43732.1 1 




|AP001518,1 1 


i 


i 


i 
1 


1 


1 
1 




|BF107694.1 


i 


i 
1 


Most Similar 
(Top) Hit 


BLAST E 
Value 


i 


1 8.9E-01 


1 8.9E-01 


1 8.9E-01 


1 8.8E-01 


1 8.8E-01 


i 


1 8.7E-01 


1 8.7E-01 


1 8.7E-01 


8.7E-01 


1 8.7E-01 


1 8.7E-01 


1 8.7E-01 


1 8.7E-01 


1 8.7E-01 


i 


1 8.6E-01 


8.6E-01 


1 8.6E-01 


1 8.6E-01 


i 


S 


1 8.6E-01 


i 


1 8.6E-01 


1 8.5E-01 


1 8.5E-01 


1 8.5E-01 


§ 

1 


Signal ■ 


2.08 










§ 
CM 










251 




5.47 






2.78 






1.01 










1 2.14| 






1 2.75| 






ORF SEQ 


ID NO: 


25476 




S 












1 221 30 1 






i 


28343 












22012I 




1 23427| 






s 






1 


1 2801 3 1 


i 


Exon 
SEQ ID 
NO: 


s 












197221 




12233 








1 






19534 




I0767| 


12109 




13642 


1 


i 


15934 








I7774| 






























































Probe 
SEQ ID 
NO: 


5494 


5786 


1 


i 


1 


1 84301 


9108 




2353 


1 


1 


1 76131 


1 


8970 






1 




2224 




1 37301 


i 


5595 


§ 


1 


i 


6443 


1 7924| 
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Top Hit Descriptor 






Mus musculus mper1 gene for periodi , complete cds | 


1 Mus musculus NK cell receptor 2B4 gene, promoter region and partial cds 1 


I Human fibroblast growth factor receptor 3 (FGFR3) gene, Intron 7 | 


1 Human fibroblast growth factor receptor 3 (FGFR3) gene, intron 7 | 


Pyrococcus abyssi complete genome; segment 5/6 | 








1 

i 

1 
f 

n 


11 DRB pseudogene DRB2 | 


II DRB pseudogene DRB2 1 


m 
□ 

i 

1 
§ 


Q 

i 

1 
i 




nn01f12.y5 NCLCGAP_Co9 Homo sapiens cDNA clone IMAGE:1076495 5' similar to contains THR.tl THR 1 
repetitive element; 


Drosophila melanogaster Lisl homolog mRNA, complete cds ( 


Mus musculus neuro-d4 gene, exons 3 through 12 and partial cds | 


'complete genome 


Phytophthora infestans mitochondrion, complete genome 1 




Rattus norvegicus mRNA for RPHO-1 , complete cds 1 




, 1L3-CT021 9-1611 99-031-C08 CT021 9 Homo sapiens cDNA | 


Rattus norvegicus mRNA for RPHO-1 , complete cds | 


CM4-HT0243-081 1 99-037-e01 HT0243 Homo sapiens cDNA | 






0 
> 

0 
EC 
Q 

i 

I 

i 

i 

1 


Streptomyces anSbioticus polyketi 


1 

3 
a 

i 

1 

1 

1 


|Canls familiaris MHC DU Class 1 


1 Canis familiaris MHC DLA Class 1 


Canis familiaris WHC DLA Class 1 


Top Hit 
Database 
Source 


NT 


iNT 1 


iNT 1 


NT 


iNT 1 


iNT 1 


INT 1 


INT 1 


INT 1 




INT 1 


iNT 1 


iNT _ 1 


INT 1 


iNT 1 




EST HUMAN 


NT 


iNT . 


H 


Lnt 




NT 


iNT 1 


]EST HUMAN | 


INT 1 


]EST HUMAN | 


NT 1 


1 IN] 


] SWISSPROT 1 


Top HitAcession 


No. 


i 


1 95070081 


i 
1 


|AF143509.1 


IL78726.1 1 


i 


i 


IM93437.1 1 


|AL1 61 506.2 1 


1 
1 




1 


1 


1 


1 




AI791952.1 


|AF098070.1 1 


|AF1 081 33.1 1 


|aE000903.1 1 


1 72124721 


£ 
< 


|AB000489.1 1 


IAF145589.1 | 


i 

i 


|AB000489,1 1 


IAW379433.1 | 


i 


IAFC52S59.1 | 


IQ9JI70 1 


Most Similar 
(Top) Hit 


BLAST E 
Value 


1 8.5E-01 


1 8.5E-01 


S 


1 8.4E-01 


1 8.4E-01 


1 8.4E-01 


i 


1 8.3E-01 


1 8.3E-01 


9 

s 


1 8.3E-01 


s 


9 

2 


1 8.3E-01 


1 8.3E-01 


1 8.3E-01 


1 8.3E-01 


1 8.3E-01 


1 8.3E-01 


1 83E-01 


1 8.3E-01 


1 8.3E-01 


1 8.2E-01 


1 8.2E-01 


1 8.2E-01 


] 8.2E-01 


i 


1 8.2E-01 


] 8.2E-01 


] 8.2E-01 


Expression 


Signal 


eg 


1 2.631 




d 


































1 2.46| 




1 2.99| 


1 1.51] 






1 3.26] 


1 3.58| 






ORFSEQ 


ID NO: 








1 237851 


1 251 45| 


1 251 46| 




i 


1 22774| 


1 23434| 


i 


24570 1 


24571 1 


1 245701 


1 24571 1 






27914] 




28204I 




§ 


1 21789] 






1 


] 26154] 




] 27871 ] 




Exon 
SEQID 
NO: 


i 








19443 






99901 




i 


i 


1 






















i 


i 






§ 


19462 


17638 


17787 






























































Probe 
SEQ ID 
NO: 


i 


i 




1 41081 


R 


1 5378 1 


7721 


724 




3739 


i 


1 4917| 


1 4917| 


5177 


5177 


1 52221 


7573 


1 78221 


1 7867 1 


1 8O62I 


1 




g 


2043 


1 


5045 


1 6122] 




7788 


1 



124 



wo 01/57274 PCT/USOl/00666 




125 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCTAJSOl/00666 



Top Hit Descriptor 


1 Human insulin receptor (allele 2) gene, exons 14, 15, 16 and 17 1 


1 P. sativum GR gene | 


1 Glardia lamblia variant-specific surface protein GSM-B (vspG3M-B) mRNA, partial cds | 


ISMALL HYDROPHOBIC PROTEIN | 


1 Homo sapiens K1AA1 072 protein (KIAA1072), mRNA | 


|NEUR/i.L-CADHERIN PRECURSOR (N-CADHERIN) | 


1 HSC1 KH041 normalized infant brain cDNA Homo sapiens cDNA clone o-l kh04 | 






1 Sphenodon punctatus alpha enolase mRNA, partial cds 1 


D.discoideum racGAP gene | 


Arabidopsis thaliana 1-amino-1-cyclopropanecarboxvlate synltiase (ACS5) gene, complete cds | 




butyrophilin^like (NG9), butyrophilin-li> 


CITRATE SYNTHASE | 


(GalNAo-T7) (GALNAC-T7), mRNA 


iHomo sapiens PR01975 mRNA, complete cds 1 


Coturnix coturnix japonica sub-species japonica beta-actin mRNA, partial cds | 


] Cotumix coturnix japonica sub-species japonica beta-actin mRNA, partial cds | 


]RAFFINOSE INVERTASE (INVERTASE) | 


] RAFFINOSE INVERTASE (INVERTASE) | 


]yf24b02.s1 Soaresfetd liver spleen IN FLS Homo sapiens cDNA clone IMAGE:1 27755 3' | 


Lycopersicon hirsutum ADP-glucose pyrophosphorylase large subunit (AGP-L1) mRNA, complete cds | 






Arabidopsis thaliana 3-methylcrotonyl-CoA carboxylase non-biotinylated subunit (MCCB) mRNA, complete 1 


Top Hit 
Database 
Source 


NT 


iNT 1 




|S\^1SSPR0T 1 


|NT 1 


|S\^ISSPROT 1 


lEST HUMAN J 


< 
5 


iNT 1 




iNT 1 




u 


z 


O 

i 








!nt 1 


]SWlSSPROT 1 


]SWlSSPROT ] 






NT 






TopHitAcession 
No. 


IM29930.1 1 








1 


i 




IAW959567.1 | 


|U87305.1 1 




|Y10159.1 1 


i 


|aF1 84345.1 1 


AF050157.1 


i 




1 
< 


1 

< 


i 
< 


IP16553 ] 


|P16553 ] 


i 


|aF184345.1 | 


1 11497621] 


AF059510.1 


|aF059510.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 7.9E-01 


1 7.9E-01 


1 7.9E-01 


9 


1 7.9E-01 


1 7.9E-01 


1 7.8E-01 


1 7.8E-01 


1 7.8E-01 




9 


1 7.8E-01 


7.7E-01 


7.7E-01 


1 7.7E-01 


7.7E-01 


1 7.7E-01 


j 7.7E-01 


] 7.7E-01 


] 7.7E-01 


] 7.7E-01 


] 7.7E-01 


7.7E-01 


] 7.7E-01 


7.6E-01 


9 


Signal 






1 4.78| 




1 2.361 


1 2.481 


1 1.75| 




1 0.79| 










1.85 
















1 1.95] 




1 6.14] 






ORF SEQ 
ID NO: 




§ 


1 27582j 


i 








1 220161 


24278] 




27399 1 








1 




23259] 




23980] 






] 25630] 






25720 


Si 


Exon 
SEQ ID 
NO: 


s 


1 


17373 


i 




18415 




12114 


1 


i 


i 




10112 


S 




13234 


1 


14195 


14195 






15541 


10112 




























































. Probe 
SEQ ID 
NO: 


1 50841 


1 6721 1 


1 7504 1 


7781 


1 8372| 


i 


i 




1 


1 


1 7323 1 


9424 




1 


i 


33.3 


s 


1 


1 


5415 


1 54151 


1 


1 921 2I 




5710 
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Top Hit Descriptor 


Mus musculus neuromedin U precursor (Nmu) gene, partial cds; tPhLP (TphIp) gene, partial cds; CLOCK 
(Clock) gene, complete cds; PFT27 (Pft27) gene, complete cxIs; and H5AR (H5ar) gene, complete cds 


1 Mus musculus advillin (Advil-pending), mRNA | 




Mus musculus cytociirome P450, 2b9, phenobarbitol inducible, type a (Cyp2b9), mRNA | 


i 

5 

o 
i 


IMUSCARINIC ACETYLCHOLINE RECEPTOR M2 | 


z 

1 

1 

E 
X 




Arabidopsis thaliana DNA chromosome 4, contig fragment No. 88 | 


f 
1 

! 

i 
1 

£ 

i 
1 


Homo sapiens chromosome 21 segment HS21 CI 01 | 


Inomo sapiens FRA3B common fragile region, diadenosine triphosphate hydrolase (FHIT) gene, exon 5 ] 


1 C14203 Clontech human aorta polyA+ mRNA (#6572) Homo sapiens cDNA clone GEN-037E1 1 5' | 


Homo sapiens hypothetical protein FLJ10793 (FLJ10793), mRNA 


Synechocystis sp. PCC6803 complete genome, 9/27, 1 056467-1 1 88885 | 




tn14b09.x1 NCLCGAP_Bm25 Homo sapiens cDNA done 1MAGE:2167577 3' similar to contains Alu 1 


[Homo sapiens mRNA for KIAA0534 protein, partial cds | 


1 Malva pusilla actin (Acti) mRNA, complete cds | 


Homo sapiens chromosome 21 segment HS21C045 | 


|601573026F1 NIH_MGC_9 Homo sapiens cDNA clone IMAGE:3834174 5' | 


iMusmuscuIuscomplementcomponentI inhibitor(C1nh),mRNA | 


1 tal 3h01 .XI NCI CGAP Lym5 Homo sapiens cDNA clone IMAGE:2043985 3' | 


1 Borrelia burgdorferi (section 52 of 70) of the complete genome | 


1 Homo sapiens HT01 7 mRNA. complete cds | 


[ Homo sapiens growth arrest-specinc 7 (GAS7). transcript variant b, mRNA | 


1 Mus musculus antigen (CD72) gene | 


1 Mus musculus antigen (CD72) gene | 


Top Hit 
Database 
Source 


z 








ISWISSPROT 1 


ISWISSPROT 1 


z 


z 


^ 


z 






{EST HUMAN | 








EST HUMAN 












|EST HUMAN | 


z 






z 


z 


Top Hit Acession 
No. 


AF1 46793.2 


i 


1 68577521 


1 


IP30372 1 


IP30372 1 


i 


1X86347.1 1 


|AL1 61 592.2 | 


IAB0207021 1 


|AL163301.2 1 


|aF020503.1 1 


|C14203.1 1 




P90907.1 


|aE000823.1 1 


1 


IAB011106.1 1 


|AF1 12538.1 1 


lALI 63246.2 1 


BE747503.1 


1 67532171 


IA1472641.1 1 


lAEOOl 166.1 1 


|AF225421.1 | 


1 53602111 


IL35772.1 1 


|L35772.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 




7.6E-01 


7.6E-01 


7.6E-01 


7.6E-01 


7.6E-01 


7.6E-01 


S 


7.6E-01 


7.6E-01 


s 


7.5E-01 


7.5E-01 


S L 


7.5E-01 


S 


7.4E-01 


S 


7.4E-01 


7.4E-01 


7.4E-01 


7.4E-01 


7,4E-01 


1 7.3E-01 


S 


7.3E-01 


7.3E-01 


7.3E-01 


§ 

1* 


























d 


I 13.921 






























ORFSEQ 
ID NO: 












274151 


1 










20315I 


2304O| 




1 


2519o| 


§ 


220761 




i 

S3 










1 




1 


1 


Exon 
SEQID 
NO: 




i 


i 












18756 




1 


1 


13236 








g 




1 


14111 






i 




































































Probe 
SEQID 
NO: 


i 


1 


1 


7184 


7347 


1 7347] 






1 


1 90731 


s 


i 


3315 


1 45731 


^1 




1114 


1 22951 


8 


4213 


7258 


1 9041 


1 9150 




4602 


5127 


i 


i 
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Top Hit Descriptor 


1 V.alginolyticus sucrase (scrB) gene, complete cds 1 


1 v.alginolyticus sucrase (sorB) gene, complete cds | 


1 
1 

< 

i 

i 

1 
! 


Izi25b08.s1 Soares_fetalJiver_spleen_1NFLS_S1 Homo sapiens cDNA clone IMAGE:431799 3' 1 


1 Rattus norvegieus initiation faotor-2 kinase (elF-2a) mRNA, complete cds | 


N.tabacum NelF-4A13 mRNA | 


1 Gallus gallus gene for melanocortin 2-receptor, complete cds 1 


j Fowlpox virus, complete genome | 




1 602035589F1 NCI CGAP Brn64 Homo sapiens cDNA clone IMAGE:41 83222 5' | 




Homo sapiens transcription factor IGHM enhancer 3, JM11 protein, JM4 protein, JM5 protein, T54 protein, 
complete cds; and L-type calcium channel a> 


Homo sapiens transcription factor IGHM enhancer 3, JM1 1 protein, JM4 protein, JM5 protein, T54 protein, 
complete cds; and L-type calcium channel a> 


Oryctolagus cuniculus RING-frnger binding protein mRNA. partial cds | 


16021 18381 F1 NIH_MGC_56 Homo sapiens cDNA clone IMAGE:4275381 5' 1 


Rattus norvegieus cytocentrin mRNA, complete cds 1 


Aeropyrum pernix genomic DNA, section 6/7 | 


lisoform(RyRI), complete cds | 


Homo sapiens partial TCF-4 gene for T-cell transcription factor-4, exons 15-16 | 


1 Mus musculus otogelin (Otog), mRNA | 


Mus musculus otogeiin (Otog), mRNA ) 


6021 55438F1 NIH MGC 83 Homo sapiens cDNA clone IMAGE:4296344 S \ 


16021 55438F1 NIH MGC 83 Homo sapiens oDNA clone IMAGE:4296344 5' | 




601496330F1 NIH MGC 70 Homo sapiens cDNA clone IMAGE:3898495 5' | 




1 Homo sapiens mRNA for KIAA0614 protein, partial cds | 


Homo sapiens mRNA for K1AA0614 protein, partial cds ] 


1 Top Hit 1 


Database 
Source 






[EST HUMAN | 


[EST HUMAN 1 




z 








1 
























5 


|EST HUMAN ] 




EST HUMAN ] 


;EST_HUMAN | 






Top HitAcession 
No. 


IM26511.1 1 


IM26511.1 1 


§ 
1 


IAA678019.1 1 


IL29281.1 1 


i 

i 


< 


|AF1 981 00.1 1 


i 
1 


IBF338350.1 | 


[090314.1 1 


1 
< 


AF196779.1 


|AF236061.1 1 


S 

8 
£ 


i 


i 
1 


a 


IAJ270777.1 1 


I 73053601 


7305360] 


IBF681 034.1 ] 


]BF581034.1 ] 


1 

3 


1 


]AA421 492.1 ] 


i 


i 


Most Similar 1 


IP 


1 7.3E-01 1 


1 7.3E-01I 


s 


1 7.3E-01 1 


1 7.2E-01| 


s 


7.2E-01 1 


1 7.2E-01I 


7.2E-0l| 


1 7.2E-01 1 


t 7.2E-O1I 


7.2E-01 




7.2E-01 1 


7.2E-01I 




7.2E-01 1 




S 


s 




S 


S 


s 


s 


s 


7.0E-01 ] 


7.0E-01 ] 






















Signal 










s 


1 














































ORF SEQ 
ID NO: 




1 264811 


e 


1 28871 1 




1 216871 


1 221861 


1 22748| 


231 24I 


1 234941 


[ 24335| 


24706 






i 


1 282561 






1 


s 


1 23794] 




1 256261 


] 26162] 






1 


] 20967] 


1 


SEQ ID 
NO: 


1 


1 16315 




1 


1 -107421 


1 




S 


13323 


1 13707| 


1 


14934 




1 16801] 


1 177721 




i 


1 




1 14014] 


1 






] 16022] 




1 19550] 






1 Probe 


SEQ ID 
NO: 


i 


1 6454, 


87291 


8729 




1914 




1 3028 1 


1 3406^ 


1 


1 


5064 


s 
s 


i 


7922 








1 30241 


4114 


4114 


i 


5623 




i 




1209 


1 
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wo 01/57274 
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Top Hit Descriptor 






zx12g12.s1 Soares_total_fetus_Nb2HF8_9w Homo sapiens cDNA done 1MAGE:786310 3' similar to 1 
contains element TAR1 repetitive element ; | 


jaltemat vely spliced; and transcription factor (Relish) gene, complete cds, alternatively spliced 


iMus musculus Wisl^ott-Aldrich syndrome protein (Wasp), mRNA 1 


^.tuberosum mRNA for glucose-6-phosphate dehydrogenase 1 


|xa95g12,x1 NCi_CGAP_Co17 Homo sapiens cDNA done IMAGE:2574598 3' 1 




Helicobacter pyiori, strain J99 section 4/ of 132 of thecompletegenome ■ | 




Gallid herpesvirus 2, complete genome | 




Helicobacter pylori, strain J99 section 47 of 132 of the complete genome | 


1 

9 
g 

■| 
1 

i 
1 

s 
0 


N-ACETYLGLUCQSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN GPI1 1 


Homo sapiens SLITI protein (SLIL2) mRNA, partial cds , ) 




(TM) and short cytoplasmic domain, (semaphorin) 5A (SEMA5A) mRNA | 


^albicans random DNA marker, 282bp | 


(HLA-H) gene. RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


Mus musculus kinesin light chain 2 (Klc2), mRNA | 


AV550506 GLC Homo sapiens cDNA clone GLCGID04 3' | 


Homo sapiens chromosome 21 segment HS21 C078 | 




H.vulga-isNa,K-ATPase alpha subunit mRNA, complete cds | 


H .vulgais Na.K-ATPase alpha subunit mRNA, complete cds | 


. Top Hit 
Database 
Source 


u 




EST HUMAN | 


INT 


iNT 


NT 1 


iEST HUMAN | 


NT 1 




NT 


NT 1 


NT 1 


NT 


EST HUMAN | 


SWISSPROT 1 


NT 1 


NT 1 


NT 1 


z 


z 


z 


EST HUMAN | 


NT 1 


NT 1 


NT 1 


NT 1 


top Hit Aoession 
No. 


|aF21 3884.1 1 


i 


|aA451864.1 | 


AF186073.1 


1 

% 


1X74421.1 1 


|AW079110.1 1 


]AJ252942.1 | 


lAEOOl 486.1 1 


i 
i 


1 

i 




]AE001486.1 1 


i 

1 


]014357 1 


AF075240.1 ( 


]AF199339.1 | 


i 


i 


U91328.1 


1 


S 

1 


AL1 63278.2 | 


AE004382.1 | 


i 
1 


M75140.1 1 


MostSimiiar 
(Top) Hit 
BLAST E 
Value 


s 


6.7E-01 


6.7E-01 


6.7E-01 


6.7E-01 1 




6.7E-0l| 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.7E-01 1 


6.6E-01 1 


6.6E-01 1 


6.6E-01I 


6.6E-011 


6.6E-01 


6.6E-01I 


6.6E-01I 


6.6E-OII 


6.6E-01 1 


6.5E-01 1 


6.5E-01 1 


Expression 
Signal 


1 29.34I 


1 I9.53I 






















% 


2.231 


3.69 


0.87| 














2.41) 






1.37| 


ORF SEQ 
ID NO: 






21887 






24032] 


24538] 


24811 


§ 




1 




263671 




281471 


222271 


1 


23168| 


i 




i 


i 




S 




20353] 


Exon 
SEQ ID 
NO: 


i 


s 


1 




12893| 




14762| 


i 




15723| 


15723| 




1 


i 


179031 


12329| 


12532| 


13362] 






15728| 




17422| 




10544| 


1 


Probe 
SEQ ID 
NO: 






... 


2120 


1 


1 43501 


4881 


5182 


5627 


5817 


5817 




6342 


8323 


8754 


2452 


1 


1 3445 1 




4020 




s 


7571 


i 


1 


1 
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protein, partial cds | 


1 

i 






i 

' i 
i 






i cytochrome b | 
















































1 


d GDP-r 










nitl.anc 






































1 
1 

1 


[Mus musculus gene for Tob2, complete cds 


iHomo sapiens interleukin 10 receptor, alpha (IL1 ORA) mRNA 






1 Phaseolus vulgaris ATPase gamma subunit mRNA, nuclear gene encoding mitoct 


Acetobacter xylinum putative ATP binding protein delta-AceB gene, partial cds, an( 




Mus musculus small GTP-binding protein RAB25 (Rab25) gene, complete cds 


1 
< 

i 

1 

i 

£ 

1 
i 

1 

§ 

1 

1 
I 


|no15c07.s1 NGI_CGAP_Phe1 1 lomo sapiens cDNA clone IMAGE:1 100748 3' 


|AU138078 PLACE1 Homo sapiens cDNA clone PLACE1007810 5' 




|hv74a10.x1 NCI_CGAP_„Lu24 Homo sapiens cDNA clone IMAGE;3179130 3' 


[S.cerevisiae chromosome IV reading frame ORF YDL097c 


1 Drosophila meianogaster 8kd dynein light chain mRNA, complete cds 




1 Homo sapiens mRNA for KIAA1607 protein, partial cds 


(M.musouluswhn gene 


1 
i 

1 


I Treponema pallidum section 63 of 87 of the complete genome 


Homo sapiens ataxia telangiectasia (ATM) gene, complete cds 


[602150289F1 NIH_MGC_81 Homo sapiens cDNA clone IMAGE:4291 1 26 5' 


IAV759212 MDS Homo sapiens cDNA done MDSCGC09 5' 




! HISTIDINE-RICH PROTEIN PRECURSOR (CLONE PFHRP-lll) 




IShigellaflexneri multi-antibiotic resistance locus 




Gallus gallus bone morphogenetic protein 1 (BMP1) mRNA, partial cds 


Lycopersicon esculentum p59a gene, complete CDS 


III 


















lUMAN 


lUMAN 1 


lUMAN 1 




lUMAN 1 


















lUMAN 1 


lUMAN 1 




5PR0T 1 














NT 


NT 


NT 








NT 




EST_H 




EST t- 




EST 1- 


NT 


z 


NT 


NT 


NT 


NT 


NT 


NT 




EST f- 


NT 


3SIMS 


NT 


NT 


IN 


NT 




TopHitAcession 
No. 


IAB041225.1 1 


1 45046321 


|AJ272265.1 | 


|AL1 61 539.2 | 


i 


|U37258.1 1 


|D8834ai 1 


|AF1 19676.1 1 


i 


IAA601287.1 1 


lAU 138078.1 1 


|aF014115.1 1 




IZ74145.1 1 


|U48848.1 1 


|U48854.2 1 


IAB046827.1 | 


lYI 2488.1 1 


IY12488.1 1 


IAE001247.1 1 


|U82828.1 1 


1 


IAV759212.1 1 


1 

i 


IP05228 1 


|U32689,1 1 


|U81 135.1 1 


IU75331.1 1 


|U75331.1 1 


? 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 6.5E-01 


1 e.5E-01 


1 6.5E-01 


S 




1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


1 6.5E-01 


S 


1 6.4E-01 


1 6.4E-01 


1 6.4E-01 


1 6,4E-01 


1 6.4E-01 


1 6.4E-01 


S 


1 6.4E-01 


9 


1 6.3E-01 


1 6.3E-01 


1 6.3E-01 


1 6.3E-01 


1 6.3E-01 


2 


Expression 
Signal 






1 4.1 7| 


1 












1 


1 4.27] 
















0.84| 














1 2.151 




i 






ORFSEQ 
ID NO: 


1 23104] 


1 236331 










260711 


28008 


1 


28215 










200351 






240661 


1 


1 27076] 


1 










202771 


219031 


S 


1 223011 






13304 






14113 




15041 


15939 




17922 


1 




1 




1 






i 


14285 


14285 


1 




1 




s 


10372 


1 


12005 


12408 




































































Probe 
SEQ ID 
NO; 




1 


4185 


4215 


i 


1 51751 




7919 


i 


8073 


1 817C| 




i 


i 






3785 


9 


g 

9 


1 


1 78071 


7818 


i 


i 






1 21161 


2534 
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Top Hit Descriptor 


1 

i 
E 

■§ 

1 
1 

d 


601S76889F1 NIH MGO 21 Homo sapiens cDNA clone IMAGE:3959351 5' | 


glycoprotein ilia{Alu 1 and 3 fusion junction} [human, Genomic Mutant, 300 nt] | 


1 
1 


Variola Virus, complete genome ( 


Escherichia coli K-1 2 MG1 655 section 203 of 400 of ttie complete genome | 


nr09h06.s1 NCI CGAP ColO Homo sapiens cDNA clone IMAGE:1161371 3' similar to TR:O0291 6 0029161 
HLARK. ; | 




HYPOTHETICAL 13.7 KD PROTEIN IN iN01-IDS2 INTERGENIC REGION 1 


HYPOTHETICAL 15.3 KD PROTEIN IN VMA12-APN1 INTERGENIC REGION 


Mus musculus keratin complex 2, gene 6g (Krt2-6g), mRNA | 




C.limicola pscD gene | 


HYPOTHETICAL 142.5 KD PROTEIN C23E2.02 IN CHROMOSOME I | 




ys01e08.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:213542 3' | 








NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-DIRECTED RNA POLYMERASE ; THIOL 1 
PROTEASE P3C ; HELICASE (2C LIKE PROTEIN); COAT PROTEIN] | 


NON-STRUCTURAL POLYPROTEIN [CONTAINS; RNA-DIRECTED RNA POLYMERASE ; THIOL 1 
PROTEASE P3C ; HELICASE (20 LIKE PROTEIN); COAT PROTEIN] | 


Mus musculus secreted acidic cysteine rich glycoprotein (Sparc), mRNA 1 




Rat TRPM-2 gene, complete cds 


Rat TRPM-2 gene, complete cds | 


Arabidopsis thaliana putative zinc transporter (ZIP1 ) mRNA, complete cds | 


Homo sapiens mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4), mRNA | 


Homo sapiens mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4), mRNA 1 


Homo sapiens G-protein coupled receptor EDG-7 mRNA, complete cds | 


Homo sapiens G-protein coupled receptor EDG-7 mRNA, complete cds | 


Top Hit 
Database 
Source 


z 


1 
5 

5 


NT. 


NT 1 


NT . 


NT 1 


z 


EST HUMAN 1 




SWISSPROT 1 




z 


NT 1 


SWISSPROT 1 


NT 


EST HUMAN | 


EST HUMAN | 


NT 


z 


SWISSPROT 1 


SWISSPROT 




NT 1 


NT 


NT 1 


NT 1 




NT 


NT 


1 


TopHitAcession 
No. 


1X99675.1 1 


IBE902044.1 | 


IS62927.1 1 


9627521 1 


96275211 


AE000313.1 1 


|aA877715.1 | 


i 

1 


IP47003 1 


P36073 1 


991 0293 1 


1 

< 


i 


Q10135 1 


AF022253.1 | 


H72255.1 1 




M24461.1 1 


AL161511.2 1 


|p27410 1 


|p27410 1 


i 




i 

s 


M64733.1 1 


AF033535.1 | 


§ 


114310651 


AF236117.1 1 


1 


Most Similar 
(Top) Hit 
BLAST E 

Value 


s 


6.3E-01 


6.3E-01 


6.3E-01 


6.3E-01 


6.3E-01 


6.3E-01 


6.3E-01 


i 


6,3E-01 


6.3E-01 


6.3E-01 


6.3E-01 


6.2E-01 


6,2E-01 


6.2E-01 


6.2E-01 


6.2E-01 


6.2E-01 


6.2E-01 


6.2E-01 


3 L 


E-01 


S 


E-01 


E-01 


s 


E-01 


E-01 


3 




















i a 

1* 






» 




2.72 




1 2.27 1 








9.21 1 




1.58| 








1.7li 


2 
oi 




5.O2I 






1 i 






3.72I 




1.75 


19.47| 




ORF SEQ 
ID NO; 






272231 




274981 


S 


28543 1 


28791 


1 




1 






i 










i 


i 


28100 




24578 


i 


26135| 






271761 


274921 


1 


Exon 
SEQ ID 
NO: 






7027 




§ 








i 








i 




i 


6710 


6448 


7336 


1 


17867 


17857 


1 




i 


1 


6671 




S 


































































Probe 
SEQID 
NO: 


1 39851 


1 


71601 


g 


74211 




1 8413| 


i 


8725 




91281 




1 


55781 


i 


i 


7435 


74761 


1 


soo 


8007 


ll 




s 

E 


61041 


i 


7108 


7108 


7419 


7419 
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§_ 

ii 



133 



wo 01/57274 PCT/USOl/00666 









































































similar to 
















1 
1 




RNA, partial c( 
























IS element PTF 












i 
















































1 






Top Hit Descriptor 


|601862474F1 NiH_MGC_56 Homo sapiens cDNA clone IMAGE:4076131 5' 




HUM500E06B Human placenta polyA+ (TFujiwara) Homo sapiens cDNA clone GEN 


1 
E 

i 
I 
1 
1 

i 

8" 

< 
1 


|ynS1bQ3.s1 Scares adult brain N2b5HB55Y Homo sapiens cDNA clone llulAGE:175 
|gb:S78187 M-PHASE INDUCER PHOSPHATASE 2 (HUMAN); 


ISPORE COAT PROTEIN SP96 


ISPORE COAT PROTEIN SP96 


|Homo sapiens partial TCF-4 genefor T-cell transcription factor-4, exons 6-11 


|Homo sapiens partial 5-HT4 receptor gene, exons 2 to 5 


i 
1 

1 

1 

I 

E 

i 
§ 


|602127577F1 NiH_MGC_56 Homo sapiens cDNA done IMAGE:4284403 5' 


IPUTATIVE TRANSCRIPTION FACTOR OVO-LIKE 1 (M0V01) (M0V01A) 




|Homo sapiens T cell receptor beta chain (BV6S7*2-BJ1S1) mRNA, partial cds 




|6C1454962F1 NIH_MGC_56 Homo sapiens cDNA clone IMAGE:3858590 5' 


iBotrytis oinerea strain T4 cDNA library under conditions of nitrogen deprivation 


|PYRR0LINE-5-CARB0XYU\TE REDUCTASE (P5CR) (P5C REDUCTASE) 


IMR3-HT0736-1 80700-003-302 HT0736 Homo sapiens cDNA 


Homo sapiens mRNA for KIAA0740 protein, partial cds 


1 
1 

i 

1 

i 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 13 


1 Chicken TBP gene, exon8, complete cds 


IAV684703 GKC Homo sapiens cDNA clone GKCFSF05 5' 


AV684703 GKC Homo sapiens cDNA done GKCFSF05 5' 


[601614007F1 N1H_MGC_71 Homo sapiens cDNA clone IMAGE:391 5457 5' 


hg75g10.s1 NCLCGAP_Pr6 Homo sapiens cDNA done IMAGE;940674 similar toe 
repstitive element; 


|HIGH AFFINITY POTASSIUM TRANSPORTER 


I602132029F1 NIH MGC 81 Homo sapiens cDNA clone IMAGE:4271 334 5' 


s| 8 


1 








lUMAN 1 


JPROTH 


5PROT 1 








1 


SPROT 1 








Z 

1 




3PR0T 1 


Z 










lUMAN 1 


1 


lUMAN 


1 Nwani 


o 
a: 


1 










Z 




i 


W 
W 


z 


Z 




X 

o5 


w 


z 


z 


Z 


X 


z 


i 


X 


z 




Z 






X 




i 


1 

« 


i 


TopHitAcession 
No. 


i 


|AB009077.1 1 


078659.1 


5 
g 


|h41571.1 1 




IP14328 1 


IAJ270774.1 1 


|AJ243213.1 1 


1 


|BF700092.1 1 


o 


|aB033503.1 1 


|AF01 1581.1 1 


U78517.1 


|BF035413.1 J 


IAL111440.1 1 


8 


|BE71 5051.1 1 


i 
1 


|AB018283.Z | 


|AL161501.2 J 


ID83135.1 1 


IAV684703.1 | 


i 


i 


|aA493535.1 | 


|P50505 1 


IBF573829.1 | 


Similar 

STE 
ilue 




9 = 
S ° 


?S 






i 




i 




s 


i 


i 








S" 


i 




i 


i 


5.6E-01 


i 


i 


S 


i 


s 




5.6E-01 


i 






















































§ 

i 1 
r 






































§. 






i 


o 












o 


ORF SEQ 
ID NO: 


1 23596 1 


1 24090, 


25768 






i 


1 


1 27224 1 








1 2291 5 1 




1 23533] 


% 


i 


1 248741 


1 






1 23044| 


i 




1 271 80 1 


1 27181 1 




i 










g ! 


?i 


1 


s 
s 


1 






i 


i 










1 






i 






% 


i 


i 






s 


1 


S 


i 






























































Probe 
SEQ iD 
NO: 




11 


II 


8 


i 


1 


i 




s 


i 


i 






1 


i 


i 


g 


i 
















i 




i 
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Top Hit Descriptor 


1 Mycoplasma genitallum section 9 of 51 of tlie complete genome | 


zu42h1 2.y5 Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:74071 1 5' | 


1 zu42h1 2.y5 Scares ovary tumor NbHOT Homo sapiens oDNA clone IMAC3E:74071 1 5' | 


|7e73=12.x1 NCI CGAP Pr28 Homo sapiens cDNA clone IMAGE;3288118 3' similar to gb;J02783 1 
IpROTEIN DISULFIDE ISOMERASE PRECURSOR (HUMAN); | 


I7e73c12.x1 NCI CGAP Pr28 Homo sapiens cDNA clone IMAGE:328S1 18 3' similar to gb:J02783 1 
IpROTEIN disulfide ISOMERASE PRECURSOR (HUMAN); | 


chlorop ast product 


|601339867F1 NIH_MGC_53 Homo sapiens cDNA clone IMAGE:3682168 5' | 


□g30e05.s1 NCI_CGAP_Br7 Homo sapiens cDNA clone IMAGE:1441376 3' similar to gb:J02611 1 
APOLIPOPROTEIN D PRECURSOR (HUMAN); 


1 Drosophila melanogaster helix-loop-helix mRNA, complete cds 1 


NUCLEAR FACTOR OF ACTIVATED T CELLS 5 (T CELL TRANSCRIPTION FACTOR NFAT5) (NF-AT5)| 
(REL DOMAIN-CONTAINING TRANSCRIPTION FACTOR NFAT5) 


Homo sapiens phospholipid scramblase 1 gene, complete cds | 


1 Homo sapiens chromosome 21 segment HS21 C085 1 


Homo sapiens mRNA for KIAA0740 protein, partial cds | 


1 Chlamvdophila abortus strain S2&3 POMP91 A and POMP90A precursor, genes, complete cds | 




] Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation | 


[am77g05.s1 Stratagene schizo brain S1 1 Homo sapiens cDNA done IMAGE:1616504 3' | 


Medicago sativa chloroplast malate dehydrogenase precursor (pi mdh) mRNA, nuclear gene encoding 1 


Homo sapiens chromosome Xq28 melanoma antigen family A2a (MAGEA2A), melanoma antigen family A12 
(MAGEA12), melanoma antigen family A2b (MAGEA2B), melanoma antigen family A3 (MAGEA3), caltractin 




[ Homo sapiens PELOTA (PELOTA) gene, complete cds | 




Idelta) 1 


a 

1 
1 
1 

1 

1 

E 

i 


1 Pdyangium ^^tellinum (strain PI vtl ) 1 6S rRNA gene | 


Top Hit 
Database 
Source 


NT 


|EST HUMAN | 


|EST HUMAN | 


[est human 1 


EST HUMAN | 




EST HUMAN 


^. 


iNT 1 


SWISSPROT 


iNT 1 


NT 


iNT 1 


iNT 1 


iNT 1 


iNT 1 


[EST HUMAN | 




NT 




NT 


|EST HUMAN ] 


SWISSPROT 1 


INT 1 




TopHitAcession 
No. 


IU39687.1 1 


|AI820921.1 1 


IAI820921.1 1 


i 

1 


|bE645620.1 | 


|l01950.2 1 


1 

s 


AA91S053.1 


|L20770.1 1 


Q9WV30 


IAF224492.1 | 


|AL1 63285.2 | 


|AB01 8283.2 | 


1U65942.1 1 


]D73443.1 1 


]AL1 16780.1 ] 


§ 
1 


AF020269.1 | 


U82671.2 


] 7106444] 


L^FI 43952.2 1 


[AWI 37066.1 1 


P18516 1 


IM58509.1 1 


jAJ233944.1 ] 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 5.3E-01| 


5.3E-01 1 


1 5.3E-01I 


5.3E-01 1 


5.3E-01 1 


5.3E-0l| 


5.3E-01 1 


s 


5.2E-01 1 


5.2E-01 1 


5.2E-01 1 


1 5.2E-01| 


5.2E-01 1 


1 


5.2E-01 1 


5.2E-01] 


s 


5.2E-01 1 


5.2E-01 


5.2E-01 ] 


5.2E-01 1 


5.2E-01 ] 


5.2E-0l| 


s 


S 


g 

11 












1 2.1 sl 


in 


<M 








1 5.04 1 


















] 1.28] 


% 








ORF SEQ 
ID NO; 






i 








i 




205681 


s 

§ 


20930 




21886] 












23287 




1 27837] 


1 25221] 




20347] 


20381 ] 


Exon 
SEQ ID 
NO: 




152701 


1 15270] 






170381 




195S1 




11061 




11739 


1 




13124] 


13278] 




I3494I 


1 




17611] 


8 




10537] 


1 


Probe 
SEQ ID 
NO: 


1 41171 


1 5350 1 


1 5350 1 


5467 




™ 


1 


9015 




1148 


1174 


1843 


1 


i 






i 


1 


i 


1 


7761 


9719 


i 




3 
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Top Hit Descriptor 




Cavia porcelluE pulmonary surfactant protein A (SP-a) mRNA, complete cds | 


Homo sapiens diacylglycerol kinase 3 (DAGK3) gene, exon 10 1 


Homo sapiens diacylglycerol kinase 3 (DAGK3) gene, exon 10 | 


Oryza saliva subsp. japonica mEF-G mRNA for mitoobondrial elongation factor G, complete cds | 


1 

i 

1 

j 

I 
o 

§ 


Mus musculus unc13 homolog (C. elegans) 1 (Unc13h1 ), mRNA 1 




nq22e11.s1 NCI_CGAP_Co10 Homo sapiens cDNA clone IMAGE:1144652 3' 1 


Homo sapiens potassium channel, subfamily K, member 5 (TASK-2) (KCNK5) mRNA, and translated 1 

prcxducts 1 


Saccharomi^es cereWsiae) sporulatlon protein (SP011) gene required for melotic recombination, complete 1 


nu85fC9.s1 NCI CGAP AM Homo sapiens oDNA clone IMAGE:1217513 


Homo sapiens reproduction 8 (D8S2298E) mRNA 1 


ArabidopsiE thaliana DNA chromosome 4, conlig fragment No. 4 | 


Arabldopsis thaliana DNA chromosome 4, contig fragment No. 4 | 


Yj77f1 0.yS Scares breast 2NbHBst Homo sapiens cDNA clone IMAGE:1 54795 5' similar to contains element 1 
|mER6 repetitive element; | 


i 
1 

1 

0 
1 

1 


Trypanosoma cruzi transposon VIP II SIRE repeat region | 


601 584324F1 NIH_MGC_7 Homo sapiens cDNA clone IMAGE:3938909 5' | 


601 883880F1 NIH WGC 57 Homo sapiens cDNA clone IMAGE:4095387 5' 


Influenza A virus isolate hk51 697 hemagglutinin (HA) gene, partial cds 1 


Human collagen alpha2(XI) (COL11A2) gene, exons 6 through 16, and partial cds | 


RC6-NT0029-240400-01 1 -E08 NT0029 Homo sapiens cDNA | 


hdl 1C08.X1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:2909198 3' | 


RC1 -ST0278-040400-C1 8-b06 ST0278 Homo sapiens cDNA | 


602081 103F1 NIH_MGC_81 Homo sapiens cDNA clone 1MAGE:4245481 5' | 


602081 1 03F1 NIH_MGC_81 Homo sapiens cDNA clone IMAGE:4245481 5' | 


Bovine steroid 21 -hydnjxylase gene (P-450-C21 ) gene, complete cds | 


INTERFERON REGULATORY FACTOR 3 (IRF-3) | 


INTERFERON REGULATORY FACTOR 3 (IRF-3) | 


601 568755F1 NIH_MGC_21 Homo sapiens cDNA clone 1MAGE:3843637 5' | 


Top Hit 
Database 
Source 












EST HUMAN 1 






EST HUMAN 1 






EST HUMAN 








EST HUMAN | 






EST HUMAN | 


EST HUMAN 






EST HUMAN | 


i 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN 1 




SWISSPROT 1 


SWISSPROT 1 


EST HUMAN | 


Top Hit Acession 


i 


IBF571462.1 | 






IAF020931.1 1 


IAB040051.1 1 


1BF209791.1 1 




|AF1 7691 2.1 1 


i 
1 






1 
1 


1 50316501 


|AL1 61 492.2 | 


IAL161492.2 | 


5 

1 
< 


IX835C2.1 1 


1 


IBE790632.1 | 


!BF217173.1 




IU41069.1 1 


1 


i 


1 


i 


1 


[U11267.1 1 


IQ90643 1 


S 
§ 
a 


|BE734781.1 | 


Most Similar 
(Top) Hit 


BLAST E 
Value 


4.9E-01 


4.9E-01 




S 


4.9E-01 




4.9E-01 


9 


4.9E-01 


4.8E-01 


4.8E-01 


4.8E-01 


4.8E-01 


4.8E-01 


9 


4.8E-01 


4.8E-01 


4.8E-01 


4.8E-01 


4.7E-01 


4.7E-01 


4.7E-01 


4.7E-01 


4.7E-01 


9 


9 




s 


4.6E-01 


4.6E-01 


4.6E-01 


'signal 






















<d 


i 


























Em 














ORF SEQ 


d 
z 


20541 1 


1 






264291 












251 53I 








1 


1 








25953 




285641 


1 






233831 






25111| 


251121 


25372| 


Exon 
SEQ ID 
NO: 








1 
























1 




























5280 


5323 
































































Probe 
SEQ ID 
NO: 




1 


i 


56791 


1 


7197 


7359 


S 


i 


1 4577I 


i 


s 


i 




1 


6613 


8111 


3 






82211 


8434 


i 


1 


3674 




i 


i 


8 


§ 


54041 
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E 



Top Hit Descriptor 


501 S57225R1 NIH MGC 67 Homo sapiens cDNA clone IMAGE:3866023 3' | 


QV2-PT0012-140100-031-C09PT0012 Homo sapiens cDNA - | 


COAT PROTEIN 


Wl32e02.x1 NCi_CGAP_Ut1 Homo sapiens cDNA clone IMAGE:242661 8 3' similar to TR:Q92923 Q92923 1 
SWI/SNF COMPLEX 170 KDA SUBUNIT. ; | 


1 

i 

1 

■1 

1 
I 

8, 

1 


Homo sapiens hvpottieticai protein DKFZp547G1 83 (DKFZp547G1 83). mRNA | 


EST02531 Fetal brain, Stratagene (cat#936206) Homo sapiens cDNA clone HFBCY1 7 \ 


EST02531 Fetal brain. Stratagene (cat#936206) Homo sapiens cDNA cione HFBCY1 7 | 


xol 4h01 .x1 NCI CGAP Ut3 Homo sapiens cDNA clone IMAGE:2703985 3' similar to SW:iNT6_MOUSE 1 
Q64252 VIRAL INTEGRATION SITE PROTEIN INT-6. [1] ; 


|6ai449201F1 NIH MGC 55 Homo sapiens cDNA cione IMAGE:3852961 5' | 


I602035275F1 NCI CGAP Brn64 Homo sapiens cDNA clone INMGE:4183290 5' | 


Homo sapiens testis-specific kinase 2 (TESK2), mRNA 1 


Entamoeba histolytica diaphanous protein (dia) gene, partial cds | 


1 Mus musculus integral membrane-associated protein 1 (Itmapl ), mRNA | 


[VASCULAR ENDOTHELIAL GROWTH FACTOR B PRECURSOR (VEGF-B) (VEGF RELATED 1 

FACTOR) 1 


Rattus norveglcus SynGAP-b mRNA, complete cds | 


iRattus novegicus SynGAP-b mRNA, complete cds | 


|7]91d02.y1 NCI_CGAP_Br16 Homo sapiens cDNA clone IMAGE:3393795 5' 


|601237139F1 NIH_MGC_44 Homo sapiens cDNA clone IMAGE:3609393 5' 1 


1 MR0-HT0078-1 31 299-007-g05 HT0078 Homo sapiens cDNA | 


IHISTIDINE-RICH GLYCOPROTEIN PRECURSOR 


\ 

1 

E 

1 

1 

g; 

r. 

3 - 

D 

' 

z 

o " 

O " 

i 

z ■ 

\ ; 


1 AV720408 GLC Homo sapiens cDNA done GLCCSC1 2 5' | 


|qi62h1 1 .XI NCI_CGAP_Brn25 Homo sapiens cDNA done IMAGE:1 861 1 25 3' simiiar to TR:Q291 68 Q291 681 
luNKNOWN PROTEIN ; 


Iqi62h11.x1 NCI CGAP Brn25 Homo sapiens cDNA clone iMAGE:1861125 3' similar to TR:Q29168 0291681 
[UNKNOWN PROTEIN ; | 


|xc27eOS.x1 NCLCGAP_Co18 Homo sapiens cDNA clone IMAGE:2585510 3' similar toTR:095154 0951541 
IaFLATOXIN B1-ALDEHYDE REDUCTASE. ; | 


IS.tuberosum mRNA for induced stolon tip protein (partial) | 


Top Hit 


DataBase 
Source 


IeST HUMAN 1 


lEST HUMAN 1 


IsWlSSPROT 1 


EST HUMAN 1 


EST HUMAN 




|EST HUMAN | 


lEST HUMAN 1 


EST HUMAN | 


lEST HUMAN 1 


|EST HUMAN | 




z 


z 


IsWlSSPROT 1 


z 




|EST HUMAN | 


[EST HUMAN | 


lEST HUMAN 1 


ISWISSPROT 1 


ISWISSPROT 


IeST HUMAN 1 


|eST HUMAN 


1 
I 


EST HUMANj 






i 


BE983445.2 1 




Q00956 1 


2 
< 


i 
< 


114447861 




i 


AW591271.1 1 


BE871461.1 1 


BF337531.1 1 




AF238234.2 1 


s 

i 




1 
< 


AF058790.1 


i 

1 


IBE378707.1 


IBE141396.1 






% 


AI198413.1 


1 
< 


1 

< 


|Z1 1679.1 


EI 
11 


BLAST E 
Value 


i 


S 


4.5E-01 1 


S 


4.5E-01 


i 




s 




9 


s 


9 


9 


9 




9 


9 


9 


4.4E-01 


$ 


4.4E-01 1 


9 


4.4E-01 


9 


9 








Signal 
















25.09 






































1 10.84| 


C 

c 


ID NO: 




25375 












1 


1 








1 






i 




1 








25051 


11 




25628 


1 






Si 






1 


































1 


% 




\ a 




i 


1 


1 


1 Probe 1 


SEQ ID 
NO: 


1 




1 


1 


1 




i 




i 


i 


i 




1 


1 












1 




i 


11 


i 


i 




% 
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Top Hit Descriptor 




i 

§ 
O 

1 

z 

! 

X 

!§i 

o 
z 


GLYCOPROTEIN B PRECURSOR (GLYCOPROTEIN 14) | 


-YROSINE-PROTEIN KINASE RECEPTOR TIE-1 PRECURSOR | 




bata -HKA=H,K-ATPase beta-subunit [rats, Genomic, 8983 nt, segment 2 of 2] | 


MusmysculusEodiumchannel, type X.alptia polypeptide (Scnl Oa), mRNA , | 






UV EXCISION REPAIR PROTEIN PROTEIN RAD23 HOUOLOG A (HHR23A) | 


Callithrix jacchus MW/LW opsin gene, upstream flanking region | 


Callithrix jacchus MW/LW opsin gene, upstream flanking region 1 


QV4-SN0024-200400-183-bC1 SN0024 Homo sapiens cDNA | 


CM2-DT0003-01 0200-077 cOI DT0003 Homo sapiens cDNA | 


MR0-BN0070-270300-008-g04 BN0070 Homo sapiens cDNA | 


Aquifex aeolicus section 30 of 109 of the complete genome | 


Human somatostatin 1 gene and flanks | 


Callithrix jacchus MW/LW opsin gene, upstream flanking region 1 


ICallNhrixjacohus MW/LW opsin gene, upstream flanking region | 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 14 . | 


Xestia c-nlgrum granulovirus, complete genome | 


Saimiri sciureus olfactory receptor (SSC1 8B) gene, partial cds 1 


1 
O 

8 

1 

•g 

1 


DNA GYRASE SUBUNIT B | 


6020231 34F1 NCI_CGAP_BmB7 Homo sapiens cDNA done IMAGE:41 58296 5' | 




i 

1 
i 

a 
S 

! 
1 

o 

1 


hh74e1 0.yl NCI GGAP_GU1 Homo sapiens cDNA done 1MAGE:29S8554 5' | 


S 
1 

o 
1 

1 

z 

P 
1 

ll 

li 

"i< 
It 

1% 

^ % 

\t 


i 
. 

: 

1 ^ 
& % 
l\ 

.2 \ 

li 


RC3-BN0034-290200-013-C12 BN0034 Homo sapiens cDNA 


I 

X 

i 
1 

1 


Top Hit 
' Database 
Source 


1 SWISSPROT 1 


|EST HUMAN | 


1 SWISSPROT 1 


[SWISSPROT 1 












jSWlSSPROT 1 






|EST HUMAN | 


|EST HUMAN | 




















ISWISSPROT 1 


lEST HUMAN 1 




lEST HUMAN | 


lEST HUMAN | 


EST HUMAN | 




lEST HUMAN 1 




TopHitAoession 
No. 


i 

o 


i 


IP28922 1 


IP35590 1 


1 


|S76404.1 1 




|AL1 63282.2 | 




1 


|AF1 55218.1 1 




|AW866550.1 | 


IAW935269.1 | 


g 


§ 


IJ00306.1 1 




|AF1 5521 8.1 1 


|AL1 61 502.2 1 


1 95352501 


|AF1 79825.1 | 


IAJ001678.1 1 


1033367 1 


|BF348001.1 1 


i 


IAW630048.1 1 




|aW170559.1 1 


|AF07S629.1 1 


AW993658.1 


|AJ003022.1 1 


Most Similar 
(Top) Hit 
B1.ASTE 
Value 


1 4.4E-01 


1 4.4E-01 


1 4.4E-01 


§ 


9 


9 


9 


1 4.4E-01 






9 


i 


9 
% 


1 4.3E-01 




9 
% 


9 
% 


9 


9 


9 


i 


9 


9 


9 


i 


§ 


9 


9 


i 


9 J 


4.3E-01 




Expression 
Signal 
























s 










































ORFSEQ 
ID NO: 




1 27798| 






1 280251 








25224| 






1 201791 






22742| 




23737| 










i 


1 








276741 




1 


S f 
S % 






Exon 
SEQ ID 
NO: 




17573 


17574 




17785 






1 










1 




12950 


13145 


1 






14755 






1 


1 




16787 


g 






i \ 


M 




































































Probe 
SEQID 
NO: 


1 7354 1 


1 


1 


7813 


1 


1 


1 


1 


i 


i 




i 


1 


1 


3022 


1 




1 


4306 


4875 






1 


1 61021 


i 


1 


1 


1 


.J 


l\ 


n 
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Top Hit Descriptor 


CELL DIVISION PROTEIN FTSH HOMOLOG PRECURSOR | 




Xylella fastidiosa, section 93 of 229 of the complete genome | 


1 

1 
O 

5 

5 
1 


QV0-LT0015-180200-127-h01 LT0015 Homo sapiens cDNA | 


SOX-8 PROTEIN | 


RC5-BT0559-020300-013-E06 BT0559 Homo sapiens cDNA 1 


njS9h01 .si NCLCGAP_Pr10 Homo sapiens cDNA clone IMAGE:997777 similar to gb:M33600 HLA CLASS 1 
II HISTOCOMPATIBILITY ANTIGEN, DR-1 BETA CHAIN (HUMAN); 


i 

u 

a 

f 

z 

s 

1 

1 

1 


601879721F1 NIH MGC 55 Homo sapiens cDNA clone IMAGE:4108403 5' 1 


RC3-CT0254-060400-029-g04 CT0254 Homo sapiens cDNA | 


AU1 58472 PLACE2 Homo sapiens cDNA clone PLACE2000470 3' | 


AU1 58472 PLACE2 Homo sapiens cDNA clone PLACE2000470 3' ] 


Brcal =breast cancer gene [rats, WF, spleen. Genomic, 419 nt, segment 2 of 2] 1 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 47 | 




EST36941 3 MAGE resequences, MAGE Homo sapiens cDNA ( 


mRNA 1 


f«3-SN0010-280300-103-h07SN0010 Homo sapiens cDNA | 


Oryzias latipes OIGC7 mRNA for membrane guanjijii cvclase, complete cds | 


501660352R1 NIH MGC 71 Homo sapiens cDNA clone IMAGE:3906085 3' | 


AV731 81 5 HTF Homo sapiens cDNA clone HTFBHH05 5' | 


RC-BT091-210199-142BT091 Homo sapiens cDNA 1 


AV705243 ADB Homo sapiens oDNA clone ADBAHF08 5' 


AV705243 ADB Homo sapiens oDNA clone ADBAHF08 5' 


PM-BT103.270499-6B4 BT103 Homo sapiens cDNA 




Arabidopsis thaliana DNA chromosome 4, contig fragment No. 36 | 




Oj94b08,s1 Scares NFL T GBC S1 Homo sapiens cDNA clone IMAGE:1 505943 3' ] 


Rhodococcus sp. AD45 isoG, isoH, Isol, isoJ, isoA, isoB, IsoC, isoD, isoE and isoF genes ( 


om33d02.s1 Scares NFL_T GBC_S1 Homo sapiens cDNA clone IMAGE:1542819 3' | 


Top Hit 
Database 


SWISSPROT 1 


1 

^1 




1 


1 


SWISSPROT 1 


EST HUMAN 1 


I 


i 




EST HUMAN 1 




EST HUMAN 






EST HUfMN 1 


1 




EST HUMAN | 




EST HUMAN | 


EST HUMAN | 


i 


EST HUMAN 1 


1 


EST HUMAN 1 








EST HUfMN 1 




1 


Top Hit Acession 
No. 


Q39102 1 


AA761 653.1 | 


AE003947.1 1 


A1280338.1 1 


AW835527.1 1 


a 


BE073574.1 1 


AA534093.1 


R13467.1 1 


BF242055.1 | 


i 

i 


AU158472.1 1 


AU1 58472.1 1 


S82504.1 1 




AW957448.1 | 


AW957448.1 1 




AW863666.1 | 


AB023489.1 | 




AV731815.1 1 


A1905481.1 1 


AV705243,1 1 


AV705243.1 1 


i 

1 




AL1 61 536.2 | 


1 






AAg09257.1 1 


Most Similar 
(Top) Hit 
BU\STE 
Value 


i 


4.2E-01 


9 


S 


4.2E-01 


9 


S 


4.2E-01 


9 


9 

^ 


3 


9 


9 


§ 


9 


9 


i 


9 


S 


9 


i 


i 


9 


9 


S 


9 


9 


9 




9 


9 


9 






















g 

11 
8-"' 




1 




° 


1 






I 




































8 










S. 




ORF SEQ 
ID NO: 


§ 




232631 




235031 


235951 




24273I 


243501 


25484| 




261641 


261 65| 


262191 


262331 


26739] 


267401 


1 


1 


28528| 








208441 


208451 




224341 


22633 1 


i 


1 






Exon 
SEQ ID 
NO: 




















5422 






6024 


1 










1 




























































































■ 

Probe 
SEQ ID 
NO: 




1904 


1 




38031 




4545 


1 


1 


5503 




E 


i 


s 






1 


6776 


1 


8400 


i 


i 


10781 


i 




i 


2678 


2910 


2910 


1 


41771 


4208 
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It 






















Top Hit Descriptor 


VB1 1 b03.s1 Scares infant brain 1 NIB Homo sapiens cDNA clone IMAGE:31 81 4 3' 


AV747880 NPC Homo sapiens cDNA clone NPCBDF10 5' 


1 Bacillus subtilis complete genome (section 21 of 21 ): from 3999281 to 421481 4 


i 
i 

i 
< 

a 

1 
■a 

1 
i, 

H 


Methanoooocus Jannaschii section 77 of 150 of the complete genome 


|6021 33261 Fl NIH MGC 81 Homo sapiens cDNA clone iMAGE:4288238 5' 




Campylobacter j^uni NCTC1 1 1 68 complete genome; segment 3/6 


1 CM2-HT01 37-200999-01 0-e08 HT01 37 Homo sapiens cDNA 


1 Zea mays ZMPMS2 gene for 1 9 kDa zein protein 


> 

z 

0 
5 

< 
0 

s 

0 

0 

0 


iHomo sapiens DNA for amyloid precursor protein, complete cds 


Drosophila melanogaster Dalmatian (dmt) mRNA, complete cds 


Mus musculus platelet derived growth factor receptor, beta polypeptide (Pdgfrb), mRNA 


1 

1 

E 

1 

i 
< 


1 
1 

E 

E 

1 

1 
< 






1 Homo sapiens chromosome 21 segment HS21C080 


icds 


Ovis aries partial JD2 gene for T cell receptor delta chain (TCRDJ2), exon 1 


Ovi's aries partial J D2 gene for T cell receptor delta chain (TCRDJ2), exon 1 




IEST382691 MAGE resequences, MAGK Homo sapiens cDNA 


Synechooystis sp. PCC 9413 transposase gene, complete cds 


[Homo sapiens chromosome 21 segment HS21C100 


IHVPOTHETICAL 49.7 KD PROTEIN IN GIN2-STE3 INTERGENIC REGION 


Campylobacter jejuni NCTC11168 complete genome; segment 2/6 




Gorilla gorilla carboxyl-ester lipase (CEL) gene, complete cds 


bp Hit 
itabase 
lource 


HUMAN 1 


HUMAN 1 




HUMAN 1 




HUMAN 1 






HUMAN 1 




SPROT 1 
























SPROT 1 


HUMAN 1 






0 
% 




1 








a 


















SWIS 
























i 
















Top HitAcession 
No. 


|R41 726.1 1 


IAV747880.1 | 




i 
i 


IU67535.1 1 


|BF674604.1 1 


i 


1 




1X58700.1 1 


§ 




|aF203478.1 | 


1 6679258] 


1 




i 


|AL163280.2 1 


IAL163280.2 | 


1 


|AJ277511.1 1 


IAJ277511.1 1 


IQ31849 1 


i 
i 


i 


|AL1 63300.2 


i 


IAL139075.2 ! 


IAW352188.1 


i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 4.1E-01| 


1 4.1E-01J 


s 


1 4.1E-01I 


1 4.1E-01I 


s 


1 4.1E-01I 


1 4.IE-OII 


9 


S 


1 4.1E-01I 


1 4.1E-01 


|. 4.CE-01 


4.0E-01 


1 4.0E-01I 


1 4.CE-01 ] 


1 4.0E-011 


1 4.0E-01 




4.0E-01 


1 4.0E-01 1 


1 4.0E-01 


1 4.OE-OII 


i 


1 4.0E-01 


1 4.0E-01 


1 4.0E-01 j 


1 4.0E-01 


1 3.9E-01 


1 3.9E-01 


Expression 
Signai 




^ 


i 








^ 






S 






^ 1 




































ORF SEQ 
ID NO: 




1 242471 








26757 


27323I 






i 


28121] 




21079 






21743 


1 190311 








i 






s 














log 










16257 




g 












1 S 






12709 










i 




1 














112621 
































































tao 




4567 


5185 


i 


1 




1 


1 


i 


8213 




s 






§ 


§ 




1 


i 










5603 




9312! 


1 




Si 


s 
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Top Hit Descriptor 


60167S418F1 NIH MGC 21 Homo sapiens cDNA clone IMAGE:3958997 5' | 




Metfianobacterium tliermoautotropliicum from bases 702375 to 714311 (section 62 of 1 48) of ttie complete | 


|HomoEapiensfiHb5geneforliairkeratin,exons1 to9 | 


1 Escherichia cdi K-1 2 MG1 555 section 225 of 400 of the complete genome | 


Mus musculus Emrl mRNA, complete cds | 


xieoel 1 .x1 NOI_GGAP_Pan1 Homo sapiens cDNA clone 1MAGE:26791 16 3' eimilar to gb:K00558 TUBULIN 1 
ALPHA-1 CHAIN (HUMAN); | 


1 Mus musculus mannose receptor, C type 2 (Mrc2). mRNA . | 


Arabidopsie thaliana DNA chromosome 4, contig 1i-agment No. 77 | 


1 Homo sapiens GAP-iike protein (LOC51 306), mRNA | 


1 Homo sapiens GAP-Iike protein (L0C51 306), mRNA | 


1501811060R1 NIH MGC 48 Homo sapiens cDNA clone IMAGE:4053951 3' | 


|501894653F2N1H MGC 19 Homo sapiens cDNA clone IMAGE:4124244 5' | 


1 Rattus norvegicus ADP-ribosyiation factor-directed GTPase activaf ng protein mRNA, compiete cds | 


|zr08a09.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone IMAGE:550872 3' | 




Danio rerio homeobox protein (hoxbSb) gene, complete cds | 


|RC5-HT021&-181099-011-g02 HT0218 Homo sapiens cDNA | 


1 7881E1 fetal brain cDNA Homo sapiens oDNA clone 788iE1-K similar to R07879, Z40498 | 


1 Rat leukocyte common antigen (L-CA) gene, Kconsi through 5 | 


IS.sorofa mRNA for CD31 protein (PECAM-1) • | 


1 Homo sapiens tumor protein p53-binding protein, 2 (TP53BP2), mRNA | 


1 Homo sapiens tyrosine kinase non-receceptor 1 (TNK1), mRNA | 


IVOLTAGE-DEPENDENT N-TYPE CALCIUM CHANNEL ALPHA-IB SUBUNIT (CALCIUM CHANNEL, L 1 
|tYPE, ALPHA-1 POLYPEPTIDE ISOFORM 5) (BRAIN CALCIUM CHANNEL 111) (Bill) | 


|X.laevis gene for albumin including HP1 enhancer 1 


1 Cgriseus rhodopsin gene for opsin protein | 


1 Gallus gallus SPARC gene for osteonectin, promoter and exon 1 | 


1 Gallus gallus SPARC gene for osteonectin, promoter and exon 1 | 


Top Hit 
. Database 
Source 


EST HUMAN 1 












EST HUMAN 1 


2 


Z 






[EST HUMAN | 


|EST HUMAN | 




EST HUMAN | 


z 




IEST HUMAN 1 


[EST HUMAN | 


z 


z 


z 




IsWlSSPROT 


z 


z 






TopHitAcession 
No. 


BE902390.1 1 


i 


i 


IY19210.1 1 


|AE000335.1 1 




§ 

i 
< 


i 


IAL161581.2 1 


77061361 


1 77061361 


IBFI 29793.1 J 


IBF310688.1 1 


U35776.1 1 


1 


U05897.1 1 


|AF071 253.1 1 




i 

z 


1 

i 


1X98505.1 1 


1 114480421 




{002294 


1 


X61084.1 


IAJ243178.1 


IAJ243178.1 


Most Similar 
(Top) Hit 
Bl^STE 
Value 


3.6E-01I 


3.6E-01I 




s 


S 


3.6E-01| 


S 




3.5E-01I 


3.5E-01| 


3.5E-01 1 


3.5E-01 1 


3.5E-01 1 


i 




3.5E-01 1 




3.5E-01 




<? 


3.5E-01 


3.5E-01I 




S 


s 


3.5E-01 


1 3.5E-01 


1 3.5E-01 


Expression 
Signal 




1 




























i 




^ 








» 








§ 






ORF SEQ 
ID NO: 


284401 
















20414 




20467 


20531 1 












24056 


1 


1 245131 






1 27238] 




i 


28251 


28513 


% 


Exon 
SEQID 
NO: 




183261 


1 




i 


i 






10596 




i 


1 


i 








1 


14275 


1 


147301 


1 




i 


1 


1 








Probe 
SEQID 
NO: 




g 

s 


874 


9044 


9127 


9981 


i 


8 


i 


1 


1 


1 


1 


1622 




J 


f 


4379 


4790 


4849 


1 


6704 




7612 


1 






i 
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Top Hit Descriptor 


tm63g05.x1 NCI_CGAP_Brn25 Homo sapiens cDNA clone IMAGE:2162840 3' similar to gb:S37431 1 
LAMININ RECEPTOR (HUMAN); | 


Q 

•1 

1 

o 
1 
1 
■ffi 

E 


Bovine enterovirus strain K2577, complete genome | 


i INTEGRIN BETA-8 PRECURSOR | 


1 INTEGRIN BETA-8 PRECURSOR | 


1 Saccliaromvces cerevisiae Mafi p (MAF1 ) gene, complete cds | 


1 
1 

1 
1 

8 
1 
1 


Homo sapiens FAA gene, exon 16, 1 7 and 18 | 




IPROBABLEE4 PROTEIN | 




1 Human vcn Willebrand factor gene, exons 35 and 37 | 


1 

1 
1 
1 

i 

i 
1 




Arabidopsis thaliana DNA chromosome 4, contig fragment No. 27 | 


1 Citrus variegation virus putative replicase gene, partial cds | 


S.cerevisiae RIBS gene encoding Riboflavin synthase | 


Schizosaccharomyces pombe CviffSp (cwf8) gene, complete cds 1 


[Human autoantigen mRNA, complete cds | 


hv42h08,x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone IMAGE:3176127 3' similar to contains PTR5.t3 1 
PTR5 repetitive element; | 


Beta vulgaris mitochondrion, complete genome | 


i 

1 

1 
1 

1 


(Bf), and complement component C2 (02) genes,> 


Rhizobium leguminosarum sym plasmid pRL5J 1 nodX gene | 


1 

1 

1 

i 
1 
1 
.1 
1 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 45 | 


iHomo sapiens KIAA1100 protein (KIAA1100), mRNA | 


IPROLINE-RICH PROTEIN LAS17 | 


Top Hit 
Database 
Source 


EST HUMAN | 


|EST_HUMAN | 


iNT 1 


ISWISSPROT 1 


ISWISSPROT 1 


iNT 1 


NT 1 


iNT 1 


T 


ISWISSPROT 1 




Z 


z 


INT 1 


iNT 1 


Int 


|NT 1 


iNT 1 


iNT 1 


EST HUMAN | 


iNT 1 


iNT 1 


NT 


NT _ 


NT 


NT . 


NT 


ISWISSPROT 1 


TopHitAcession 
No. 


< 


1 


1 96336241 


IP26013 1 


i 


IU19492.1 1 


IU19492.1 1 


IAJ225084.1 1 




S 
§ 


|aF045981.1 1 


IM2585S.1 1 


IM25855.1 1 


|AB035507.1 | 


< 




1221621.1 1 


IAF254351.1 | 


IL26339.1 1 


|bE21865Z1 1 




IAJ297131.1 1 


AF019413.1 


1X07990.1 1 


i 


< 




IQ12445 1 


Most Similar 
(Top) Hit 
BLAST E 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


s 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


3.4E-01 


S 


3.4E-01 


S 


3.4E-01 


3.4E-01 


3.4E-01 


3.3E-01 


3.3E-01 


3.3E-01 




, 3.3E-01 


h 

r 










ri 




4.17| 


^ 


4.09 




.J 




1.77| 










1.82| 




? 


2.13| 




1.79 




I 3.92| 


1.16 






ORF SEQ 
ID NO: 


1 26161 1 






1 273901 


273911 




i 








28572 


287421 


287431 




i 


















1 167911 


1S791 


u 


1 203661 


1 20940 1 


Exon 

SEQ ID 
NO: 






















§ 






§ 






1 




§ 




i 


1 




1 


00001 




i 


1 


























































Probe 
SEQ ID 
NO: 






1 71391 


7313 


7313 


7433 


7433 


7690 




8407 


8440 


1 


i 


8790 




9020 


1 91301 


1 


1 




9432 


9537 


i 






1 
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Top Hit Descriptor 


6021 8401 6T1 NIH_MGC_42 Homo sapiens cDNA clone 1MAGE:4300251 3' 


Mus musculus disintegrin 5 (Dtgn5), mRNA 1 


EST36722 Embryo. 8 week 1 Homo sapiens cDNA 5' end 1 


decarboxylase) (UMPS) mRNA 


Bacteriophage phi-Ye03-12 complete genome 1 


INTERLEUKIN-12 ALPHA CHAIN PRECURSOR (IL-12A) (CYTOTOXIC LYMPHOCYTE MATURATION 1 
FACTOR 35 KD SUBUNIT) (CLMF P35) 




Homo sapiens MTA1-L1 gene, complete cds | 


> 
1 
0 

0 

1 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 1 0 | 


Hypoxyion fragiforme chitin synthase gene, partial cds 1 


Rattus norvegicus DNA for regucalcin, partial cds- | 


tp78b12.x1 NCI CGAP_Ut3 Homo sapiens cDNA clone IMAGE.2205407 3' similar to gb:X57522 ANTIGEN 1 
PEPTIDE TRANSPORTER 1 (HUMAN); I 


R.norvegicus mRNA for 3'UTR of ubiquitin-like protein | 


R.norvegicus mRNA for 3'UTR of ubiquitin-like protein | 


601 472768T1 NIH_MGC_58 Homo sapiens cDNA clone IMAGE:3875753 3' | 


501472768T1 N1H_MGC_S8 Homo sapiens cDNA clone IMAGE:3875753 3' 


ty84h01 .x1 NCI_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2285809 3' similar to contains Alu 
repetitive element.contains element L1 repetitive element ; | 


ty84h01 .x1 NCI_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE;2285809 3' similar to contains Alu 1 
repetitive element;contains element LI repetitive element ; | 


J2498F Human fetal heart. Lambda ZAP Express Homo sapiens cDNA clone J24S8 5' similar to TEGT \ 


602140372F1 NIH MGC 45 Homo sapiens cDNA clone IMAGE:4301800 5' 


i 

§ 
0 

i 
t 
1 

i 

i 

1 

i 
g 

1 


RC4-TN0C77-250S0O-01 1 -g04 TN0077 Homo sapiens cDNA 1 




D.mauritianaAdh gene 1 


D.mauritianaAdh gene 1 


i 

1 

1 
< 

1 
i 
s 

0 

s 


hv51 g02.x1 NCI CGAP Lu24 Homo sapiens cDNA clone IMAGE:31 76978 3' | 


Top Hit 
Database 
Source 


EST HUMAN | 


NT 1 


EST HUMAN | 




NT 1 


SWISSPROT 1 


NT 1 


NT 


SWISSPROT 1 


NT 


NT 


z 


EST HUMAN | 


INl 


NT 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 


EST HUMAN | 




EST HUMAN | 


EST HUMAN | 


EST HUMAN 1 




NT 1 


NT 




EST HUMAN 


Top Hit Acession 
No. 






AA332734.1 | 


4507834I 


AJ251 805.1 1 


0 


AJ007932.2 1 


AB01 2922.1 1 




AL1 61 498.2 1 


AF200446.1 1 


1031662.1 1 


A1539114.1 1 


i 


i 


BE619650.1 


|BE61 9650.1 1 


AI628131.1 


|aI628131.1 


N85146.1 






IBF37S745.1 


1L41 044.1 




i 


IBF526499.1 


|BE21 9351.1 


Most Similar 
(Top) Hit 
BLAST E 


3,3E.01 


3.3E-01 


3.3E-01 


9 


3.3E-01 


3.3E-01 


3.3E-01 


9 


3.3E-01 


3,3E-01 


3.3E-01 


3,3E-01 


i 


3.3E-01 


3.3E-01 


9 


3.3E-01 


3.3E-01 


3.3E-01 


3.3E-01 


3,3E-01 


9 


3.3E-01 


3.3E-01 


9 


3.3E-01 


3.3E-01 


9 




4.82 






J 












1.59 




274 




§ 










S 




19.46 


^ 




2.07 








12.76 


ORF SEQ 
ID NO: 


21046 


21352 






22648 




22737 


23173 


23433 


23582 


23609 






1 


i 




i 






1 


27053 


En 


27504, 




1 


28243 




28700 


Exon 
SEQ ID 
NO: 


1193 


1 


8 






2910 


i 


3367 






i 


4170 




i 




5537 




§ 






§ 




g 






S 


1 


"8431| 


























































Probe 
SEQID 
NO: 


■l285 


1588 


1704 


.J 


1 


1 


s 




373« 


i 


3920 


4271 








5622 


5622 


J 


6117 


6575 


1 


7477 




7739 


8104 


8104 




1 8561 
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(35 KD 
ECTIN 




















































§ 




Top Hit Descriptor 


GALECTIN-3 (GALACTOSE-SPECIFIC LECTIN 3) (MAC-2 ANTIGEN) (IGE-BINDING PROTEIN) 
L-29) (CBP30) 


|ob71g02.s1 NCi CGAP GOBI Homo sapiens cDNA clone IMAGE:1336850 3' 




1 Homo sapiens aldehyde oxidase 1 (A0X1), mRNA 


iPyrococcus horikoshii 0T3 genomic DNA, 287001-544000 nt. position (2/7) 


1 Rattus norvegicus EH domain binding protein Epsin mRNA, complete cds 






P.vulgaris aic5-1 gene 


LACTOSE PERMEASE (U\CTOSE-PROTON SYMPORT) (LACTOSE TRANSPORT PROTEIN) 


1 EST369264 MAGE resequences, MAGD Homo sapiens cDNA 


|EST369264 MAGE resequences, MAGD Homo sapiens oDNA 




|601858804F1 N1H_MGC_17 Homo sapiens cDNA clone IMAGE:4111512 5' 


E 

1 

1 

1 

i 
1 

1 

i 


|Homo sapiens promyelocytic leukemia zinc finger protein (PLZF) gene, complete cds 




|Arabidopsis thaliana DNA chromosome 4, contig fragment No. 46 






[HYPOTHETICAL 81.7 KD PROTEIN C13G7.04C IN CHROMOSOME 1 PRECURSOR 




Mouse renin (Ren-1-d) gene, complete cds 


1 Homo sapiens interleul^in 12 p40 subunit (1L12B) gene, IL12B-1 allele, complete cds 


|CM0-HT0569-OS0300-269-f10 HT0569 Homo sapiens cDNA 


1 Rat ISO-atrial natriuretic factor gene, complete cds 






l|i 


1 


lUMAN 1 
















>PROT 


lUMAN 


lUMAN 




lUMAN 1 














5PROT 1 


fUMAN 1 






lUMAN 1 






lUMAN 1 




i 






z 


z 


z 


z 


z 


z 






w' 


z 




z 


z 


z 


z 




z 






z 






z 


z 




TopHltAcession 
No. 


P47953 


i 

s 


i 


1 


1 


i 


|AL1 61 551.2 1 


|AF047013.1 1 


|Z50202.1 1 


a ? 


AW957194.1 


|AW957194.1 1 


|AL1116S5.1 1 


|BF2a3817.1 1 




§ 


5 


|AL161546.2 1 


5 

i 




i 


i 


|M3235Z1 1 


|AY008847.1 1 


i 
s 


|M60266.1 1 


1x02508.1 1 


|BF31 1635.1 1 


Most Similar 
CTop)Hit 
BLAST E 
Value 


i 


S 


s 


1 3.3E-01 


i 




1 3.2E-01 


1 3.2E-01 


■? 


3.2E-01 


i ^ 


i 


i 


i 


s 


[ 3.2E-01 


i 


i 


i 




i 


i 






i 3.2E-01 




3.2E-01 


j 3.2E-01 


Expression 
Signal 










§ 


2 




§ 




i: 


^ s 






g 








§ 














g 








ORF SEQ 
ID NO: 






i 










1 20902] 




211301 


21513 


1 215141 


\ 215741 


i 




22433 1 








24052 


i 






24771 


1 249131 


i 




i 


Exon 
SEQ ID 
NO: 


1 


s 


1 






i 


i 


§ 






3 i 


1 


i 












i 










§ 

s 


1 






i 






























































i 


i 




§ 


1 








8 P 






§ 




i 




i 


i 


i 






i 


1 








i 


i 
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Top Hit Descriptor 




Human monocyte antigen CD14 (CD14) mRNA, complete cds | 




Homo sapiens gene for AF-6, complete cds | 


1 EST04702 Fetal brain, Stratagene (cat#936206) Homo sapiens cDNA clone HFBDZ21 | 


1 Drosophila melanogaster laminin A (Lam-A) mRNA, complete cds ■ 1 


[ELONGATION FACTOR TU(EF-TU) | 




601275480F1 NIH_MGC_20 Homo sapiens cDNA clone IMAGE:351 5746 5' 1 


ye90h06.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE;125051 5' similar to 1 
gb:M64241 QM PROTEIN (HUMAN); 


Homo sapiens KIAA01 74 gene product (KIAA01 74), mRNA 1 


Homo sapiens KIAA01 74 gene product (KIAA01 74), mRNA | 


hi46ii08.x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:29753gi 3' 1 


Mus musculus gene for Ser/Thr kinase KKIAMRE, exon 6 | 


Daucus carota mRNA for transcription factor E2F (E2F gene) | 


nm61hC5.s1 NCLCGAP_Br3 Homo sapiens cDNA clone IMAGE:1072761 3' | 


Homo sapiens hepatocyte nuclear factor-3 alpha (HNF3A) gene, exon 1 | 


Mus musculus mRNA for polycystin | 


Homo sapiens filamin 2 (FLN2) gene, exons 10 through 22 | 


z 

i 
1 
1 

X 

1 
1 

0 


601 3051 21 F1 N1H_MGC_39 Homo sapiens cDNA clone IMAGE:3S40420 5' 


ra46f01.s1 Soares infant brain 1NIB Homo sapiens oDNA clone 1MAGE:35639 3' | 


602124743F1 NIH_MGC_56 Homo sapiens cDNA clone 1MAGE:4281611 5' 1 


602124743F1 NIH_MGC_56 Homo sapiens cDNA clone 1MAGE:4281611 5' | 


qjS1e11.x1 NCI CGAP Kid3 Homo sapiens cDNA clone IMAGE:1S63980 3' similar to gb:S55700 1 
HYDROXYMETHYLGLUTARYL-COA LYASE PRECURSOR (HUMAN); | 


601883592F1 NIH MGC 57 Homo sapiens oDNA clone 1MAGE:4095814 5' | 


Homo sapiens KIAA0764 gene product (KIAA0764), mRNA | 


for mitochondrial product 






Top Hit 
Database 
Source 


r 


k 


iNT 1 


iNT 1 


[EST HUMAN 1 




ISWISSPROT 1 


NT 




EST HUMAN 


iNT 1 


INT 1 


lEST HUMAN 1 




iNT 1 


[EST HUMAN | 


iNT 1 


iNT 1 


iNT 1 


lEST HUMAN | 


lEST HUMAN 1 


|EST HUMAN | 


[EST. HUMAN 1 


lEST HUMAN | 


EST HUMAN | 


|EST HUMAN | 


iNT 1 






iNT - 1 


Top Hit Acession 
No. 


|AL1 61 574.2 1 


i 5 

i S 
C 5 


IU44914.1 1 


lABOl 1399.1 1 


IT06813.1 1 


i 

3 


|083217 1 


L39874.1 


|BE385776.1 1 


R1 8051.1 


1 


1 76619711 


|AW629036.1 1 


i 

g 


IAJ251 586.1 1 


IAA576308.1 | 




IY13278.1 1 


IAF184122.1 1 


|AW983549.1 1 


IBE737392.1 | 


IR45318.1 1 


1 
I 


IBF696639.1 | 


|a1244001.1 1 


IBF216117.1 1 




AF294308.1 


|AF304162.1 1 


|AF1 95953.1 | 


MostSimilar 
(Top) Hit 
BI.ASTE 
Value 


3.2E-01 


1 3.2E-01 


s 


1 3.2E-01 


1 3.2E-01 


1 3.2E-01 


1 3.2E-01 


3.2E-01 


S 


IE-01 


lE-01 


IE-01 


IE-01 


9 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


9 


IE-01 


9 


9 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


9 


IE-01 












































•-15 

1* 









3.25 














1 345| 


1 3.451 








i 


1 


1 45.031 






IN 


1 1.871 










s 


s 


s 




ORF SEQ 

ID NO: 










§ 








1 


22397! 


2241 9| 


22420 1 






S3 


2471 7| 






25510] 






27092| 




27875 




28348 










Exon 
SEQ ID 
NO: 


1 


171241 


17591 


17760 


i 


i 


1 




i 


12503 


i 


1 


12756 




13744 


14943 






15444 




1 


1 


1 


1 


1 


1 




i 


19007 


S 






























































Probe 
SEQID 
NO: 


11 


7247 




7910 


8058 


9152 


1 


i 


i 


1 


i 


1 


1 


3136 


3832 


1 50731 


1 


5447 


5527 


1 


6160 


1 70231 


7791 


7791 




8211 


i 


i 


9315 


1 



152 
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Top Hit Descriptor 


Homo sapiens transcription factor IGHM enhancer 3, JM1 1 protein, JM4 protein, JM5 protein, T54 protein, 
complete cds; and L-type calcium channel a> 


1 Mus musculus protein kinase C, epsilon (Pkce), mRNA 1 


1 Homo sapiens Xq pseudoautosomal region; segment 1 12 \ 


|xs63f08.x1 NCLCGAP_Krd1 1 Homo sapiens cDNA clone 1MAGE:2774343 3' 1 


— , — , 




|PM1-ST0262-261199-001-g01 ST0262 Homo sapiens cDNA | 




|S01594960F1 NIH_MGC_9 Homo sapiens cDNA clone IMAGE:3948734 5' | 


1 RC3 BT0333-1 80700-1 1 1 -a03 BT0333 Homo sapiens cDNA | 


1 RC3-BT0333-1 80700-1 1 -a03 BT0333 Homo sapiens cDNA | 


1 Mus musculus 1 29/sv Clara cell 1 0 kd protein (mCCI 0) gene, complete cds | 


Mouse oytokeratin 1 5 gene, complete cds 


1 Mus musculus midnolin (Midn-pending), mRNA 1 


1 Streptococcus pneumoniae strain DBL5 PspA (pspA) gene, partial cds 1 


1 Thermotoga maritima section 67 of 1 36 of the complete genome | 


(Clecsf9), mRNA 


|601339079F1 NIH_MGC_53HomoEapienscDNAclonelMAGE:3681594 5' | 


1 Asperglius oryzae bipA gene for ER chaperone BiP, complete cds | 


|yp84b10.r1 Soa-es fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:194107 5' | 


ypB4b1 o n Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:1 94107 5' 


1 Rattus norvegicus mRNA for glyceraldehyde-3-phosphate dehydrogenase type 2 (gapdh-2 gene) 1 


1 Mus musculus ribose 5-phosphate isomerase A (Rpia), mRNA 1 


1 Aquifex aeolicus section 68 of 1 09 of the complete genome | 




IXenopus laevis transcription factor E2F mRNA, complete cds | 


|PM1-CT0325-1 71 299-001 -fl 2 CT0326 Homo sapiens cDNA | 


1 PM1 -CT0325-1 71 299-001 -fl 2 CT0326 Homo sapiens cDNA | 


TopHH 
Database 
Source 








EST HUMAN 


E 




|EST HUMAN 




1 
5 


EST HUMAN 
















|EST HUMAN | 




|EST HUMAN | 


EST HUMAN 












EST HUMAN | 


EST_HUMAN | 


Top HitAcession 
No. 


1 
< 




|AJ271 735.1 1 


|AW300400.1 1 




AF237778.1 


|AB030481.1 1 


< 


|AJ006755.1 1 


)BE741629.1 | 




|BE693575.1 | 


s 


1016313,1 1 


1 109470071 


< 


IAE001 755.1 1 


i 




IAB030231.1 1 




i 


i 


g 


IAE000736.1 1 


|AF222718.1 | 


IAF078111.1 1 


1 AW 754239.1 | 


1 AW 754239.1 | 


Most Similar 
(Top) Hit 
BLAST E 
Value 




s 


3.0E-01 


S u 


J S 


3.CE-01 


s 


i 


s 


s 


S 


s 


S 


3 


S 


3.0E-01 


s 


3.0E-01 


3 




3 




3 


2.9E-01 


3 


2.9E-01 


2.9E-01 


2.9E-01 


Expression 
Signal 






1 11.121 




! 1 


i 






























§ 








i 








ORF SEQ 
ID NO: 




i 


1 200371 


§s 


I i 








i 


1 251471 


1 














1 27007| 


i 


1 29074] 


i 






s 


i 








c S .. 




1 












1 


15205 


1 


i 


s 






1 


90991 




i 






18785 


1 






1 


13072 
































































1 

Probe 
SEO ID 
NO: 


i 




5 


i ! 


I 1 


1 3175| 


1 37891 


1 




1 53791 


1 53791 


1 




i 


1 


1 




i 


i 


i 




9564 


1 


1 19781 




1 31471 


1 32131 


i 
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Top Hit Descriptor 


|fp21a11.x1 NCI_CGAP_Gas4 Homo sapiens cDNA clone IMAGE:2188412 3' similar to gb:D15050 NIL-2-A 1 
|ZINC FINGER PROTEIN (HUMAN);contains element L1 repetitive element ; | 


|wr02f10.x1 NCLCGAP_GC5 Homo sapiens cDNA clone IMAGE:2480395 3' | 


|zs57d12.r1 NGLCGAP_GCB1 Homo sapiens cDNA done I^MGE:701 591 5' similar to contains Alu 1 
1 repetitive element; | 


1 Mus musculus SKDI (Skdl) gene, complete cds 1 


|Mus musculus SKD1 (Skdl) gene, complete cds | 


1 Mus musculus gene, complete cds, similar to EXLM1 | 


|yf77e12.s1 Scares Infant brain 1NIB Homo sapiens cDNA clone IMAGE;28291 3' | 




system pdypeptidesP16,18,28.30 and levanase | 


|Mus musculus Eph receptor A8 (Epha8), mRNA | 


iBos taurus myosin 1 mRNA, complete cds 1 


Mus musculus Fliih protein (Flilh) gene, complete cds; and LIglh protein (LIglh) gene, partial cds | 


1 PUTATIVE MULTICOPPER OXIDASE YDR506C | 


Mus musculus major histocompatibility locus class II region; Fas-binding protein Daxx(DAXX) gene, partial 


601065a30F1 N1H_MGC_10 Homo sapiens cDNA clone IMAGE:3452287 5' 1 


|601065830F1 NIH MGC 10 Homo sapiens cDNA done IMAGE:3452287 5' | 


jTrvpanosoma crua stage-specific surface glycoprotein gp82 (gp82) mRNA, partial cds | 


I Torpedo callfornlca mRNA encoding acetylcholine receptor gamma subunit | 


I Torpedo californica mRNA encoding acetylchdine receptor gamma subunit ■ | 


ny35h02.s1 NCI CGAP Pr12 Homo sapiens oDNA clone 1MAGE:1273779 similar to contains LTR8.t2 LTR8 
repetitive element; 




|wz88f05.x1 NCI CGAP Brn25.Homo sapiens oDNA clone IMAGE:2565921 3' similar to contains element 1 
MER29 repetitive element; 


iHomo sap-ens TN F-a-inducible RNA binding protein (TIRP) gene, complete cds | 


|601482059F1 NIH_MGC 68 Homo sapiens cDNA done IMAGE:3884559 5' | 


Chlamydomonas reinhardtii mRNA for nitrite reductase structural locus ( 


1 Chlamydomonas reinhardtii mRNA for nitrite reductase structural locus | 


Top Hit 
Database 
Source 


Iest human 


EST HUMAN 


Iest human 








Iest human | 












ISWISSPROT 1 




Iest human | 


|est_human 1 








EST HUMAN 1 




EST HUMAN 




lEST HUMAN | 






Top Hit Acession 


z 


i 

1 














1x56098.1 1 




1 


|U03420.1 1 


[AF142329.1 | 


IQ04399 1 


AF1 00956.1 


IBE540422.1 1 


|BE540422.1 | 


|AF1 28843.1 | 


IV01394.1 1 


i 
1 




IAL139078.2 1 


AW005671.1 


IAF092453.1 | 


IBE788199.1 1 




i 


|AW002902 


|aA284468/ 


i 




|AB019029.- 


H/lost Similar 
(Top) Hit 


BLAST E 
Value 


<? 


1 2.9E-01 1 


1 2.9E-01 


s 


1 2.9E-01 1 


i 


1 2.9E-01 1 


1 2.9E-0l| 


2.9E-0l| 


t 


1 2.9E-01| 


[ 2.9E-01 1 


i 


2.9E-01 


9 


1 2.9E-01 1 


1 2.9E-01I 




1 2.9E-01I 


i 


1 2.9E-01I 


2.9E-01 1 


2.9E-01 1 


i 


I 




1 


Signal 










d 


























§ 














1 




ORFSEQ, 


Q 






24059 


1 


1 240651 


1 24401 1 




25504 


25505 


1 2551 3 1 


1 


[ 24847 1 


t 26234 1 


26267 


i 




1 28387 1 








i 








g 




Exon 
SEQ ID 
NO: 




I 


1 


i 


s 








15440 






15128 


§ 








18147 




18375 




S 


E 


s 


19212 


8 


1 






















































Probe 
SEQ ID 
NO: 


1 3821 


i 


1 


i 


4388 


4729 


5218 


J 


1 5522! 


1 


5818 


1 81711 


1 


1 


i 


i 


1 


8502 


8502 


i 


1 


i 


9602 




i 


i 
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Top Hit Descriptor 


qp48h01 .x1 NCI CGAP CoS Homo sapiens cDNA clone IMAGE:1926289 3' similar to gb:X0S323 cds1 1 
MITOCHONDRIAL 60S RIBOSOMAL PROTEIN L3 (HUMAN); | 


qp48h01 jcl NCI CGAP CoS Homo sapiens cDNA clone IMAGE:1926289 3' similar to gb:X06323 cdsl 1 
MITOCHONDRIAL 60S RIBOSOMAL PROTEIN L3 (HUMAN); 






Mus musculus centrin (Cetn2) gene, complete cds 1 




Homo sapiens hypottietical protein (LOC51319), mRNA ( 


601880794F1 NIH MGC 55 Homo sapiens cDNA done 1MAGE:4109350 5' | 


601880794F1 NIH_MGC_55 Homo sapiens cDNA done IMAGE:4109350 5' | 


601852148F1 NIH_MGC_56 Homo sapiens cDNA done IMAGE:407S026 5' 1 


Drosophila heteroneura fruitless (fru) gene, alternative splice products, 5' flanking region, exons 1 through 7 1 
and complete cds | 


602137418F1 NIH MGC 83 Homosaplens cDNA done IMAGE:4273853 5' | 


Mus musculus DNA for prostaglandin D2 synthase, complete cds 1 




Homo sapiens GDC42-binding protein kinase beta (DMPK-like) (CDC42BPB), mRNA | 


Rattus norvegicus CDK104 mRNA | 


2x39b10.s1 Soares_total_fetus_Nb2HF8_Sw Homo sapiens cDNA clone 1MAGE:788827 3' similar to 1 




1 

i 

M 
1 


i 

1 

1 

Q 

1 
g. 

1 
1 

i 
i 

1 

1 


GA(3 POLYPROTEIN [CONTAINS; INNER COAT PROTEIN P12; CORE PROTEIN P15; CORE SHELL 1 
PROTEIN P30; NUCLEOPROTEIN P10] 






ta43c1 1 .x2 NCI_CGAP_Lu25 Homo sapiens cDNA done IMAGE:2046836 3' similar to contains element LI 1 

repetitive element ; | 


CM1 -HT0875-060900-385-e05 HT0875 Homo sapiens cDNA | 


W092e1 1 .x1 NCI_CGAP_Kid11 Homo sapiens cDNA done IMAGE:2462828 3' | 


Homo sapiens DiGeorge syndrome critical region, telomeric end | 


Tritlcum aestivum (Wcs66) gene, complete cds . | 


Top Hit 
Database 
Source 


EST HUMAN | 


EST HUMAN 




i 








EST HUMAN | 


EST HUMAN 1 


? 




estjhuman 1 




EST HUMAN 1 






i 






EST HUMAN | 


IsWISSPROT 1 






EST_HUMAN 




EST_HUMAN 1 






TopHitAcession 
No. 


|aI346126.1 | 




^U51688.1 1 


I 




IL13654.1 1 


1 


m 


BF241 062.1 1 


1 
m 


|aF051 662.1 1 


i 






1 


i 


|aA450061.1 1 


1 


X79815.1 1 




P03341 


< 


i 

? 


i 
1 


i 

e 

m 


< 


L77569.1 1 


L27518.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 2.8E-01 


i 


2.8E-01 


i 


2.8E-01 


2.8E-01 


S 


S 


2.8E-01 


2.8E-01 


2.8E-01 


2.8E-01 


2.8E-01 


2.8E-01 


3 


2.7E-01 


S 


S 


S 


3 


3 


2.7E-01 


2.7E-01 


s 


2.7E-01 


3 




S 


Expression 
Signal 






























§. 






































ORFSEQ 
ID NO: 


1 26781 1 














1 


8 


i 


26382I 






252361 












214711 


21507 




220941 


§ 




236181 


236261 




Exon 
SEQ ID 
NO: 


1 


i 








i 


1 


S 




i 


18134 


1 


E 




E 


i 






1 


§ 


11639 






?i 






i 


i 
























































Probe 
SEQ ID 
NO: 


6713 


i 


67781 


i 


e 


1 


77891 


8126 


81251 






1 


1 




1 


1 




1 


i 


i 


1 










39291 


1 


4795 
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Top Hit Descriptor 




1 ADENYUTE CYCLASE (ATP PYROPHOSPHATE-LYASE) (ADENYLYL CYCLASE) | 


gene encoding mitochondrial protein, mRNA | 


gene encoding mitochondrial protein, mRNA | 




Imus musculus ICR/Swiss glyceraldehyde 3-phosphate dehydrogenase (Gapd-S) gene, complete cds 1 


Ureaplasma urealyticum ssotion 57 of 59 of the complete genome | 


1 

1 
1 

i 

.i 
i" 
i 

1 
§ 

1 

1 


Botrytis cinerea strain T4 cDNA library under condittons of nitt-ogen deprivation | 


Homo sapiens hyperpolarization activated cyclic nucleotide-gated potassium channel 4 (HCN4) mRNA 


|PM4-CT0400-310700-005^08 CT0400 Homo sapiens cDNA | 


lPM4-CT0400-310700-005-d08CT0400HomosapienscDNA 1 


Aqulfexaeolicus section 7 of 109 of the complete genome | 


jzs11a12.r1 NCI CGAP_GCB1 Homo sapiens cDNA clone IMAGE:684862 5' | 


B.taurus mRNlA for D-aspartate oxidase ] 


1 EST385464 MAGE resequences, MAGM Homo sapiens cDNA ] 


1 Danio rerio peptide YY precursor gene, complete cds 1 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 29 J 


|wg1 1C07.X1 Scares NSF F8 9W OT PA P S1 Homo sapiens cDNA clone IMAGE:2364780 3' | 


1 wgll o07,x1 Soares_NSF_F8_9W_OT_PA_P_S1 Homo sapiens cDNA clone IMAGE:2364780 3' | 


lA-AGGLUTlNlN ATTACHMENT SUBUNIT PRECURSOR | 


1 RHIB PROTEIN | 


iMus musculus neuronal apoptosis inhibitory protein 6 (Naip6) gene, complete cds; and Naip3 gene, exons 2-9| 
and 11-16 | 


1 MOLT-INH IBITING HORMONE PRECURSOR (MIH) | 


Choristoneura fumiferana diapause associated protein 2 (DAP2) mRNA, complete cds ] 


1 Vibrio cholerae chromosome 1 1 , section 73 of 93 of the complete chromosome ] 


Mus musculus annexin V gene, intron 4 segment containing 5' LTR and gag portion of MuERV-L (murine 1 
endogenous retrovirus) element [ 


1 601 437463F1 NIH_MGC^72 Homo sapiens cDNA clone IMAGE:3922500 5' ] 


Top Hit 

Database 
Source 




1 SWISSPROT 1 












[EST HUMAN 






|=ST HUMAN 


[EST HUMAN 




|EST HUMAN | 




|EST HUMAN | 








|EST HUMAN | 


1 SWISSPROT 1 


1 SWISSPROT 1 




iSWISSPROT 1 








EST HUMAN | 


TopHitAcession 
No. 


IAF141 325.2 | 


IQ01631 1 


45022961 


45022961 


IM26501.1 1 




|AE002156.1 1 




i 


4885406 


i 

i 


i 

s 


IAE000675.1 1 






1AW973471.1 ] 


1AF233875.1 j 


IAL161517.2 ] 


IAI741483.1 ] 


i 


1 


IQ03314 ] 


AF242431.1 | 


IQ27225 ] 


IAF007768.1 ] 




AJ230113.1 


iBE896785.1 ] 


MostSimiiar 
(Top) Hit 
BIJ^STE 
Value 


1 2.6E-01| 


1 2.6E-01| 


1 2.5E-0l| 




1 2.5E-01I 


2.5E-0l| 


1 2.5E-01I 


I 2.5E-01I 


2.5E-01I 


2.5E-01 1 


t 


1 2.5E-01| 


2.5E-01I 


9 


2.5E-01 ] 




1 2.5E-01 ] 


2.5E-01] 


9 


9 


2.5E-01] 


2.5E-011 


2.5E-0l| 


2.5E-01] 


9 


i 


9 


2.5E-01] 


Expression 
Signal 


i 












<P 






^ 


1 




S 










^ 






1 




i 










S 


ORFSEQ 
ID NO: 






1 20025 


1 20025 










i 






1 21613] 






1 223591 








1 


1 










1 


1 24351 1 




1 24398] 


Exon 
SEQ ID 
NO: 






10209 


i 






s 
§ 




1 


1 


12706 


12706] 


12237] 




1 




1 


13419| 


i 


i 


i 


14120] 


I 






i 


i 




Probe 
SEQ ID 

NO: 


1 


i 










1044 




1 


1697 


1 


1 


2357 


1 


1 


1 


1 


3502 


3774 


3774 


3977 


4222 


451 


1 


1 


i 


i 


i 
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Top Hit Descriptor 




hh75f09.y1 NCLCGAP_GU1 Homo sapiens cDNA cbne IMAGE:2968649 5' similar to contains TAR1 .t2 1 
TAR1 repetitive element; 


hh75f09.y1 NCI_CGAP_GU1 Homo sapiens cDNA cbne IMAGE:2968649 5' similar to contains TAR1 .t2 1 
TAR1 repetitive element; 


T3 receptor-associating cofactor-1 [human, fetal liver, mRNA, 2930 nf] | 


1 
1 

1 
i 

i 
1 

5 


Homo sapiens chromosome 21 segment HS21 0082 | 


7i57a03.x1 Soares NSF F8 9W OT PA P SI Homo sapiens cDNA clone II^GE:3525389 3' | 


601459238F1 NIH MGC 66 Homo sapiens cDNA clone IMAGE:3862809 5' | 


yq84f07.r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:202501 5' | 


Homo sapiens matrix metalloproteinase MMP Rasi-1 gene, promoter region | 


Homo sapiens matrix metallaproteinase MMP Rasi-1 gene, promoter region | 


Hordeum vulgare receptor-like kinase LRK10 gene, partial cds | 


Hordeum vulgare reoeptor-like kinase LRK1 0 gene, partial cds | 


RC3-ST0186-130100-015-a07ST0186HomoEapienscDNA | 


xg40c1 0.x1 NCI_CGAP_Ut1 Homo sapiens cDNA clone IMAGE:2630034 3' similar to contains Alu repetitive 1 
element;contains element MSR1 repetitive element ; 


MouseLIMd LINE DNA | 


Human mRNA for K1AA0124 gene, partal cds 


Zea mays cellulose synthase-4 (GesA-4) mRNA, complete cds | 


Arabidopsis thaliana DNA chromosome 4, contig fragment No. 41 1 


Spodoptera frugiperda CALNUC mRNA, complete cds | 


on70d04.s1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:1562023 3' | 


602132442F1 NIH MGC 81 Homo sapiens cDNA clone IMAGE:4271 578 5' | 


Homo sapiens KIAA0851 gene (partial), XT3 gene and L-ZTFL1 gene | 


Homo sapiens KIAA0851 gene (partial), Xr3 gene and LZTFLI gene | 


s. 

t 

1 




i 

z 

on 

E 

f 
t 

i 

1 

1 
1 

1 




IMMUNOGLOBULIN AI PROTEASE PRECURSOR (IGA1 PROTEASE) | 


Aquffex aeolious section 12 of 109 of the complete genome | 


Top Hit 1 






EST HUMAN 1 


EST HUMAN 








i 


EST HUMAN | 


i 










EST HUMAN | 


EST HUMAN 












EST_HUMAN | 


EST_HUMAN | 














0 

i 


z 


: 


i 


AB011070.1 1 


|aW663183.1 | 


|aW663183.1 | 


383390.1 1 


AF134119.1 1 


AL1 63282.2 1 


§ 


BF038595.1 1 


H53236.1 1 


U89651.2 1 


U89651 2 1 


AF085164.1 1 


AF085164.1 1 


AW581997.1 1 


|aW152246.1 | 




D50914.1 1 


AF200528.1 | 


AL161541.2 1 


< 


AA936316.1 | 


BF576124.1 | 




1 


Y17293.1 1 


AF267753.1 | 


AF251 708.1 1 


AF11 1168.2 1 


i 


AE000680.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


2.5E-01 




2.5E-01 


2.5E-01 


2.5E-01 


i 


2.5E-01 


2.5E-01 


2.5E-01 


2.5E-01 


9 


2.5E-01 


S 


2.5E-01 


2.5E-01 




2.5E-01 


2.5E-01 




2.5E-01 


2.4E-01 


2.4E-01 


2.4E-01 


2.4E-01 


2.4E-01 




2.4E-01 


2.4E-01 


i 


2.4E-01 




Signal 




1 


1 








q 






































24.081 




». 






ORFSEQ 
ID NO; 


24421 1 


.J 


24803 1 


249621 








§ 


270871 


275541 


27555 


275471 


27548 




28IO5I 


§ 














S 


i 


21123] 






Eo 




220051 


Exon 
SEQ ID 
NO: 




5035 




5186 


1 


s 




i 


i 








7341 


i 


1 


i 


1 


i 






0482 




§ 


i 








i 


2010 


2101 






























































Probe 
SEQ ID 
NO: 


47501 


1 




i 




0099 


i 


1 


70201 


7481 




7522 


i 


781 4| 


8010 


i 


1 


9074 


91001 


1 












i 




20911 




2215 
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Top HH Descriptor 


C.familiaris romi gene | 


as27e12.x1 Barstead aorta HPLRB5 Homo sapiens cDNA clone IMAGE:2318446 3' similar to gb:X13238 1 
CYTOCHROME C OXIDASE POLYPEPTIDE VIC PRECURSOR (HUMAN); | 


as27e12.x1 Barstead aorta HPLRB6 Homo sapiens cDNA clone IMAGE:2318446 3' similar to gb:X13238 1 
CYTOCHROME C OXIDASE POLYPEPTIDE VIC PRECURSOR (HUMAN); | 


as42f1 2.X1 Barstead aorta HPLRB6 Homo sapiens cDNA clone 1MAGE:231 9887 3' similar to contains Alu 
repetitive element; | 


Glycine max resistance protein LM1 7 precursor RNA, partial cds j 


Mus musculus myosin XV (Myol 5), mRNA - 1 


601511573F1 N1H_MGC_71 Homo sapiens cDNA clone IMAGE:3912859 5' | 


zai 2e08.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 1MAGE:292358 5' | 


Oxytrlcha nova macronuclear telomere-blnding protein alpha subunit (tel-alpha alanine version) gene, 1 
complete cds | 




MR0-HT0559-2404OO-O1 4-g1 1 HT0559 Homo sapiens cDNA | 




601546155R2NIH MGC 59 Homo sapiens cDNA clone IMAGE;41 02092 3' | 


Mus musculus partial mRNA for muscle protein 534 (mg534 gene) | 


Mus musculus partial mRNA for muscle protein 534 (mg534 gene) | 




Borrelia burgdorferi 2.9-6 locus, ORF-A-D genes, complete cds and REP+ gene, partial cds | 


HCOEST44 HT29M6 Homo sapiens cDNA clone HCoE44 5' 


chn1424.seq.F Human fetal heart, Lambda ZAP Express Homo sapiens cDNA 5' | 


PM4-SN0012-030400-001-b06 SN0012 Homo sapiens cDNA | 


xv21d07.x1 Soares_NFL T GBC_S1 Homo sapiens cDNA clone IMAGE;281 3773 3' similar to TR:Q9Z1 75 
Q9Z1 75 LYSYL OXIDASE-RELATED PROTEIN 2. ;contains PTR5.b2 TAR1 repetitive element ; 


501 507202F1 NIH MGC 71 Homo sapiens cDNA clone 1MAGE:3908689 5' | 




Rattus norvegicus mRNA for acid gated ion channel | 


Rattus norvegicus mRNA for acid gated ion channel | 


nao39h1 2.x1 Lupskl_sciatic_nerve Homo sapiens cDNA clone IMAGE:3395950 3' similar to contains element! 
MER38 repetitive element ; | 


Top Hit 
Database 




EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


z 




EST HUMAN 1 


EST HUMAN j 






EST HUMAN 1 




EST HUMAN | 




z 






EST HUMAN 


EST HUMAN | 




EST HUMAN 


EST HUMAN | 


EST HUMAN 1 


z 




EST HUMAN | 


TopHitAcession 
No. 


X96587.1 1 


AI708840.1 1 


AI708840.1 1 


i 


AF1 75389.1 | 




BE888071.1 1 


N80983.1 1 




X52124.1 1 


BE1 73060.1 1 


AJ293261.1 1 


BF1 33577.1 | 


i 

3 


AJ250189.1 1 


AE002167.2 1 


1 


T27231.1 1 


1 


AW863940.1 | 


AW303623.1 


i 

a 


1 


3 


AJ006519.1 1 


BF475611.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


2.3E-01 




2.3E-01 




2.3E-01 


2.3E-01 


2.3E-01 


2.3E-01 


2.3E.01 


2.3E-01 


2.3E-01 


2.3E-01 


2.3E-01 


2.3E-01 


2.3E-01 


2.3E-ai 


9 


2.3E-ai 


2.3E-ai 


2.3E-01 


2.3E-01 


2.3E-ai 


i 


2.3E-01 


2.3E-01 


2.3E-01 


.9 _ 

i 1 

r 


S. 




















S 

cvi 


cvi 






S 
cvi 














1 






S 




ORF SEQ 
ID NO; 


25361 1 


1 


1 




26408| 










27732I 




278111 




1 




288301 










1 


249051 
































i 
















S 




i 






1 




























































Probe 
SEQ ID 
NO: 


1 


1 






i 


i 


i 


i 


i 


J 




77401 


7974 




8525 1 


1 


9144| 


9232 


1 


1 




i 


1 9407 


1 


1 


9793 
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1 

: 


|o2l4a10.x1 Soares_fetaUiver_spleen_1NFLS_S1 Homo sapiens cDNA clone 1MAGE:1675290 3' similar to 1 
TR:Q13040 Q13040 ATP-BINDING CASSETTE PROTEIN ; | 


1 Homo sapiens PPAR delta gene, promoter region | 




Fresh-wafer sponge Emf1 alpha collagen (COLF1 ) gene | 


602085608F1 NIH MGC 83 Homo sapiens cDNA clone IMAGE:4249969 5' | 


601 4S2629F1 NIH MGC 67 Homo sapiens cDNA clone 1MAGE:3866190 5' | 


PM2-HT0353-281 299-003-a1 2 HT0353 Homo sapiens cDNA | 


1 PM2-HT0353-2S1 299-0D3-a1 2 HT0353 Homo sapiens cDNA | 




1 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 62 ' | 




iMus musculus ATP-binding cassette protein (Abcb8) mRNA, partial cds | 




Mus musculus mixed lineage kinase 3 (Mlk3) and two pore domain K+ channel subunit (Kcnk6) genes, 1 

1 complete cds | 


t Mus musculus MAP kinase kinase kinase 1 (Mekkl ) mRNA, complete cds | 


1 Mus musculus MAP tanase tonase kinase 1 (Mekkl ) mRNA, complete cds | 


i 
1 

¥ 

i 

i" 
% 
1 


1 
1 

i 
1 


i 

1 


[Human beta-cytoplasmic actin (ACTBP9) pseudogene | 


|zq87c05.r1 Stratagene hNT neuron (#937233) Homo sapiens cDNA clone IMAGE:648968 5' ' | 


|Human dystrophin gene | 


1 Mus musculus vinoulin gene, exon 3 | 


iHomo sapiens diaphanous (Drosophila, homolog) 2 (DIAPH2), transcript variant 156, mRNA | 


1 Synechocystis sp. PCC6803 complete genome, 1 9/27, 2392729-2538999 | 


1 

i 
1 

Q 

s 

i 
i 

> 


i 

i 

1 


1 

1 


1 Mus musculus nm23-M1 gene, promoter region | 


iThermotoga maritima section 25 of 136 of the complete genome | 


Top Hit 
Database 
Source 


:EST HUMAN | 


z 






EST HUMAN 1 


EST HUMAN 


jEST HUMAN 1 


lEST HUMAN 1 


z 




z 


z 


z 


z 




z 


z 






z 


lEST HUMAN | 










lEST HUMAN 1 








z 


TopHitAcession 
No. 


|aI052190.1 1 


1 


i 


i 




i S 

1 1 


1 


|BE1 55625.1 1 


|aF020503,1 1 




< 


IAF213391.1 1 


< 


< 


< 


lAF1 17340.1 1 


i 


- 


IZ54148.1 1 


Q 


IAA211215.1 1 


IM86524.1 1 


IL13299.1 1 




ID64000.1 1 


|AV756238.1 1 


|M24135.1 1 


IM2413S.1 1 


|AF1 551 43.1 1 


|AE001713.1 


Most Similar 
(Top) Hit 


BLAST E 

Value ' 


1 2.2E-01 


1 2.2E-01 




1 2.2E-01 




2,2E-01 


i 


S 




9 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


2.2E-01 




1 2.2E-01 


1 2.2E-01 


S 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


2.2E-01 


1 2.2E-01 


9 


1 2.2E-01 


1 2.2E-01 


1 2.2E-01 


! 


Signal 


1 








6.24 




























o 




(\i 








3.99' 












ORF SEQ 


6 

Q 








1 218291 


22131 














1 




1 


23840 


23841 


1 239331 




1 24078] 




8 






1 






1 


26343 




1 


Exon 
SEQ ID 
NO: 














12781 


















§ 
? 






1 












154361 


§ 








17034 






























































Probe 
SEQ ID 
NO: 




1545 


s 


2042 


2354 






1 


s 




3743 


1 40071 


4118 




i 


8 




4258 


9 


1 47191 


4724 


1 


4975 


5513 


5518 


i 
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Top Hit DescripiDr 


PM3-CT0263-241299-009-b07 CT0253 Homo sapiens cDNA 


1 Mus musculus deformed epidermal autoreguiatory factor 1 (Drosophila) (Deafi ), mRNA | 




1 za04f08.r1 Soares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE.291 591 5' | 


URGE PROLINE-RICH PROTEIN BAT2 (HLA-B^ASSOCIATED TRANSCRIPT 2) 


nuclear gene for chloroplast product 


|601859724F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE:4100189 5' 1 


1 Drosophila 68C glue gene cluster | 


Homo sapiens H-2K binding factor-2 (LOC51580), mRNA | 


[601 446957F1 NIH_MGC__55 Homo sapiens cDNA clone IMAGE:3850670 5' | 


Homo sapiens chromosome Xq28 melanoma antigen family A2a (MAGEA2A), melanoma antigen family A12 
(MAGEA1 2), melanoma antigen family A2b (MAGEA2B), melanoma antigen family A3 (MAGEA3), caltractin 


[Vitis ^flnifera cuhivar Pinot Noir plasma membrane aquaporin (PIPIa) mRNA, complete cds | 


RC1 -CT0249-1 41 1 99-021 -g04 CT0249 Homo sapiens cDN A | 


I AV694801 GKC Homo sapiens cDNA clone GKCAHB02 5' 1 


[nm31e1 1 .s1 NCLCGAP_Lip2 Homo sapiens cDNA clone IMAGE:1061804 | 


[Arabidopsls thaliana DNA chromosome 4, contig fragment No. 16 | 


[ Chlamydia muridarum. section 45 of 85 of the complete genome 1 


[ Mus musculus interferon (alpha and beta) receptor 2 (Ifnar2), mRNA | 


Mus musculus interferon (alpha and beta) receptor 2 (Ifnar2), mRNA | 


[ok73e02.s1 NCI CGAP GC4 Homo sapiens cDNA clone IMAGE:1519610 3' similar to gb:K02765 1 
1 COMPLEMENT C3 PRECURSOR (HUMAN); | 


[60208312gF1 NIH MGC 81 Homo sapiens cDNA clone IMAGE:4247503 5' | 






[ IMMEDIATE-EARLY PROTEIN IE180 | 


IIMMEDIATE-EARLYPROTEINIEI8O 1 


Homo sapiens mRNA for KIAA1 21 5 protein, partial cds ) 


Homo sapiens pshsp47 gene, complete cds ( 


Homo sapiens hox1 1 p-oto-oncogene, exons 1 to 3 and hug-1 gene | 


Top Hit 
Database 
Source 


lEST HU^MN 1 


z 


[EST HUMAN | 


|EST HUMAN | 


SWISSPROT 




IeST HUMAN 1 






[EST HUMAN | 


H 
Z. 




[EST HUMAN 1 


[EST HUMAN 1 


[EST HUMAN | 


z 


z: 








IEST HUMAN | 


[EST HUMAN | 




z 


[SWISSPROT 1 


[SWISSPROT 1 


z 


z 


z 


TopHitAcession 
No. 




1 8393247| 


|BF376354.1 | 


|W02988.1 1 


2 : 


AF197941.1 


|BF206507.1 1 


1X01918.1 1 


1 77062151 


[BE870959.1 1 


U82671.2 


[AF188843.1 | 


[AW361098.1 1 


1 


§ 

i 


[AL161 504.2 1 


1AE002314 2 1 


1 


1 




1 


i 


6912445^ 




IP11675 1 


[P11675 1 


[AB033041.1 1 


[AB010273.1 1 


[AJ009794.1 1 


MostSimiiar 
(Top) Hit 
BUVSTE 
Value 


2.2E^1 


1 2.2E-01I 


1 2.2E-01 1 


1 2.2E-01I 


2.2E-0lj 


2.2E-01 


2.2E-01| 


j 2.2E-01 1 


\ 2.2E-01I 


[ 2.2E-0l| 


2.2E-01 


2.2E-01I 


[ 2.2E-01I 


1 2.2E-01I 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


IE-01 


9 


IE-01 




<N 






oi 




















Expression 
Signal 


2.29 






















2.341 














1.241 








[ 5.05| 












ORF SEQ 
ID NO: 




27341 1 


27382| 


27421 1 


275501 


27720 


277941 




281491 








1 




207271 


207291 




i 


209371 




i 


226I3I 






236521 




241571 


1 


Exon 
SEQID 
NO: 


















1 








i 








s 
















13874 




1 


1 


























































Probe 
SEQ ID 
NO: 


7217 


7270 


1 7304 1 


1 7353 1 


K i 




7719 


1 87231 


ss 


1 90771 


9183 


i 


9379 


i 


i 


i 


1 11081 


1 11801 


1180 


1 1871 1 


2111 


i 


1 37361 


i 




1 42791 


4474 


i 
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Top Hit Descriptor 




Homo sapiens 14q32 Jagged2 gene, complete cds; and unknown gene | 


Methanococcus jannaschii section 67 of 1 50 of the complete genome | 


601449441F1 NIH MGC 65 Homo sapiens cDNA clone 1MAGE:3853330 5' | 


601449441 F1 NIH MGC 65 Homo sapiens cDNA clone 1MAGE:3853330 5' | 


H.sapiens Na+-D-glucosB cotransport regulator gene | 


Homo sapiens full length insert cDNA YH85A1 1 | 


HOMEOBOX PROTEIN GLABRA2 (HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-10) (HD-ZIP 1 
PROTEIN ATHB-10) | 


xpl 5b02j<1 NCLCGAP_HN9 Homo sapiens cDNA clone IMAGE:2740395 3' similar to contains element 1 

MER21 repetitive element; | 


CED-11 PROTEIN | 


Cparasiloa oapC gene | 


QV4-EN 0032-1 90500-223-e03 EN0032 Homo sapiens cDNA ' 1 






Homo sapiens putative psihHbD pseudogenefor hair keratin, exons 1 to 9 | 




1 

E 

i 
1 

1 
1 
1 

! 


Saccharomyces cerevisiae Hal5p (HALS) mRNA, complete cds 1 




PM1 -CT0247-1 41 099-001 -g06 CT0247 Homo sapiens oDNA 


Andes virus strain 01231 33 glycoprotein G1 and G2 precursor, gene, partial cds | 


1 
1 


Chlamydia trachomatis section 5 of 87 of the complete genome 1 


Arabidopsis thaliana root gravitropism control protein (P1N2) gene, complete cds | 




Salvelinus pluvius mRNA for transfemn, complete cds | 


Salvelinus pluvius mRNA for transferrin, complete cds 1 


Pimephales promelas liver gluooE6-6-phosphate-1-dehydrogenase mRNA, partial cds | 


ovSOal 0.s1 Soares testis NHT Homo sapiens cDNA clone 1MAGE:1643610 3' | 


Homo sapiens Ku70-binding protein (KUB3) mRNA, partial cds [ 


Top Hit 

Database 
Source 








|EST HUMAN | 


|EST HUMAN | 






O 


1 


0 

1 




|est_human 1 
















|EST HUMAN 1 


















|EST HUMAN | 




TopHitAcession 
No. 


c < 




1 


IBE871330.1 1 






IAF074990.1 | 


1 


1 


1 


i 


IBE825165.1 | 


i 
1 








1432540] 






§ 

i 


|AF028026.1 | 




|AE001278.1 1 


|AF0869a7.1 1 


IAF086907.1 | 


ID89088.1 1 




IAF206637.2 | 


IAI023592.1 1 


IAF078164.2 | 


|Y19216.1 










Most Similar 
(Top) Hit 
BljSiSTE 
Value 


1 2.0E-01 


S 


9 


9 


1 2.0E-01 


9 


1 2.0E-01 


9 


9 


9 


9 


9 


9 


9 


9 


9 


1 2.0E-01 


1 2.0E-01 


9 


1 2.0E-01 


2.0E-01 


9 


2.0E-01 


2.0E-01 


9 


1 2.0E-01 


9 


1 2.0E-01 


9 


9 
















1 




1 


i 
















s 












^! 




eg 






















ORF SEQ 
ID NO: 


1 21300 






1 21618| 








1 






236281 






i 


1 




t 254911 


1 257051 




259621 




i 






1 


s 


i 




1 25202| 








§ 










12789 






13560 


13853 


14358 




14847 


,4913 


i 


15428 


1 


i 


1 


16517 


1 






1 


i 


1 


19118 




§ 
Si 
































































Probe 
SEQ ID 
NO: 




i 




s 




1 


i 


1 3442I 




3646 


1 39451 




1 


4972 


1 


i 




1 


i 


i 


6637 




si 


1 7774] 


7774 


8214 




1 95031 


1 9747 1 


1 
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Top Hit Descriptor 


yh48h1 0.r1 Soares placenta Nb2HP Homo sapiens cDNA clone 1MAGE:133027 5' | 


E.dispar mRNA for hexokinase (hxk1) | 


601274S04F1 NIH MGC 20 Homo sapiens cDNA done IMAGE:3615768 5' | 


P.dumerilii histone gene cluster for core histones H2A, H2B, H3 and H4 | 


NEUROFILAMENT TRIPLET L PROTEIN (NEUROFILAMENT LIGHT POLYPEPTIDE) (NF-L) | 


Lymantria dispar nucleopolyhedrovirus, complete genome | 


Lymantria dispar nucleopolyt^edro^rus, complete genome | 




regulatory protein (hapR) gene, complete cds, and YRAL VIBCO gene, partial cds 


regulatory protein (hapR) gene, complete cds, and YRAL VIBCO gene, partial cds 




2 

1 
1 
1 


Naja naja atra ctx-1 gene, exons 1-3 | 


Taxus canadensis geranylgeranyl diphosphate synthase mRNA, complete cds | 


Anabaena sp. ORF4 (partal), ORF3, ORF2, 0RF1 , adpA gene, adpB gene, adpC gene, adpD gene, adpE 1 
igene and adpF gene 


Homo sapiens haol gene, complete CDS | 


Homo sapiens derivative 1 1 breakpoint fragment: partial Intron 1 0 of the ALL-1 /MLUHRX gene fused to intron 1 
5oflheAF-4/FELgene 


Schistocerca gregaria alpha repetitive DNA | 


qh57e09.x1 SDares_fetalJi>'er_splBen_1NFLS_S1 Homo sapiens cDNA clone IMAGE:1 848808 3' similar to 1 
contains OFR.bl OFR repetitive element ; | 


Zea mays calcium-dependent protein l<inase (MZECDPK2) mRNA, complete cds | 


Zea mays starch branching enzyme lib (ae) gene, complete cds | 


ne13a02.s1 NCI CGAP Co3 Homo sapiens oDNA clone IMAGE:881066 3' similar to gb:M17886 60S 1 
ACIDIC RIBOSOMAL PROTEIN PI (HUMAN); | 


ne13a02.s1 NCI CGAP Co3 Homo sapiens cDNA clone 1MAGE:881066 3' similar to gb:M17886 60S 1 
ACIDIC RIBOSOMAL PROTEIN PI (HUMAN); | 


ys02g06.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA done IMAGE:213658 3' | 


Mesccricetus auratus oviductin precursor (OVI) gene, complete cds | 


Top Hit 
Database 


EST HUMAN | 




EST HUMAN | 




SWISSPROT 1 


NT 1 




NT 1 






EST HUMAN | 






NT 1 


NT 


NT 1 


NT 




EST HUMAN 




NT 1 


Iest human 1 


EST HUMAN | 


EST_HUMAN 1 




TopHitAcession 
No. 


R24494.1 1 


Y11114.1 1 


BE385164.1 | 




P35616 1 




AF081810.1 1 


AF255051.1 1 


i 


AF000716.1 


AA336909.1 | 


AJ238736.1 1 


1 
< 


AF081514.1 1 


i 


AJ224877.1 | 


AJ235377.1 


X52936.1 1 


AI247635.1 


U28376.1 1 


AF072725.1 | 


i 
i 


i 
1 


H72118.1 1 


AF026552.3 | 


Most Similar 
(Top) Hit 
BLAST E 
Value 


8E-01 


8E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


1.7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


7E-01 


S 


3 


7E-01 


7E-01 


7E-01 


















































•a -o 




^ 
















1.93 


1.741 


1.35| 


1.351 




^ 




4.41 


















ORFSEQ 
ID NO; 














1 




s 
Si 


22549 


1 




i 


227851 


23119 


^ 


23562 




24403 






25046 


25047 


258391 




Exon 
SEQID 
NO: 


192721 


19294] 


105031 


10717| 


1 


1 


§ 




s 


12758 




1 


1 






13471 1 


13770 


14353 1 


I46I7I 


1 


1 


15241 


15241 


157261 




Probe 
SEQID 
NO: 


97381 


i 






i 


1042 


10421 


1 


i 




i 


1 


2967 


1 


1 


35571 


1 


1 


4732 


1 


5067 


.J 


5321 


5820 


6112 
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Top Hit Descriptor 


|Homo sapiens apelin gene, complete cds | 


1 EST380677 MAGE resequences, MAGJ Homo sapiens cDNA 1 


1 Mus musculus chaperonin subunit 3 (gamma) (Cct3), mRNA 1 


S.cerevisiaechromDsomeXI reading frame ORF YKRIOSc 


IS.cerevisiaechromosomeXI reading frame ORF YKR1 05c | 


|2l84h09.s1 Stratagene colon (#937204) Homo sapiens cDNA clone IMAGE:511361 3' similar to TR:E221955 1 
E221 955 38:855 BP SEGMENT OF CHROMOSOME XIV. ; 


Lyoopersicon esculentum Rsal fragment 2, satellite region | 


[ Lycopersicon esculentum Rsal fragment 2, satellitB region ( 


bb83h08.y1 NIH_MGC 10 Homo sapiens cDNA clone IMAGE:3049023 5' similar to gb:M61 71 5 1 
TRYPTOPHANYL-TRNA SYNTHETASE (HUMAN); gb:X69657 M.musculus (MOUSE); | 


xm43f01 xl NCi_CGAP_GC6 Homo sapiens cDNA clone IMAGE:2686969 3' similar to TR:075984 075984 1 

HYPOTHETICAL 127.6 KD PROTEIN ; 


Ixni43f01.x1 NCI CGAP GC5 Homo sapiens cDNA clone IMAGE:2686969 3' similar to TR:075984 075984 1 
HYPOTHETICAL 127.6 KD PROTEIN; | 




! Arabidopsis ttiaiiana DNA chromosome 4, contig fragment No. 84 1 


1 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 84 | 


|UI-H-B12-agi-b-06-0-Ul.s1 NCI_CGAP_Sub4 Homo sapiens oDNA clone IMAGE:2724418 3' | 


1 2822248.5prime NIH_MGC_7 Homo sapiens cDNA clone IMAGE:2B22248 5' | 


Gorilla gorilla androgen receptor gene, partial exon 1 




RC3-ST0200-04119g-011-hD1 ST0200 Homo sapiens cDNA | 


S.cerevlsiae chromosomeX readingframe ORF YJROOIw | 


1 1L3-CT0220-1 1 1 199-028-G01 CT0220 Homo sapiens cDNA 1 


CHROMODOMAIN-HELICASE-DNA-BINDING PROTEIN 2 (CHD-2) 


1 CHROMODOMAIN-HELICASE-DNA-BINDING PROTEIN 2 (CHD-2) 




Mus musculus adaptor-related protein complex AP-1, beta 1 subunit (Aplbl), mRNA | 


IQV2-PT0010-160400-133-a08 PT0010 Homo sapiens cDNA | 




|AV719585 GLC Homo sapiens cDNA clone GLCEMF07 5' | 


Cucumis safivus KS mRNA for ent-kaurene synthase, complete cds 1 


iHomo sapiens mRNA for FLJ00104 protein, partial cds | 


Top Hit 


Source 


iNT 1 


IEST HUMAN 1 




Int 1 




z 
1 


iNT 1 


iNT . 1 


Z 
1 

I 


EST_HUMAN 


EST HUMAN | 


NT J 


NT 


Int 1 


|EST HUMAN | 


i 


NT_ 1 


iNT 1 


[EST HUMAN | 


NT 


1est_human 1 


O 

a: 
1 


ISWISSPROT 1 




NT 


lEST HUMAN | 


iNT 1 


|EST HUMAN | 




Int 1 


TopHitAcession 




i 

< 




i 


IZ28330.1 1 


|Z28330.1 1 


1 


IAJ006356.1 1 


|AJ006356.1 1 


1bE018707.1 1 


AW 197496.1 


|aW197496.1 | 


|AF034716.1 1 


lALI 61 588.2 1 


|AL1 51 588.2 1 


1 


1 


|L49349,1 1 


IZ49501.1 1 


IBF375171.1 1 


IZ49501.1 1 


1 


|014647 1 


1 

5 


|AF1 06064.1 1 


1 6671552] 


IAW877127.1 1 


1 


|AV719685.1 1 


1 


|AK024496.1 | 


Most Similar 
(Top) Hit 


BLAST E 
Value 


.6E-01 


3 


.5E-01 


.5E-01 


s 


9 


9 


9 


.6E-01 


.6E-01 


9 


9 


.6E-01 




9 


.6E-01 


9 


.6E-01 


§ 


9 




i 


W-39- 




9 






.5E-01 


i 


.5E-01 






























































1 

1 






i 




« 


1 




\ 1-92| 


1 •'■92| 


! 






2.07| 










_ 1.42 


! 




1.911 










1 10.071 






2.33^ 


1 6.331 




ORF SEQ 


ID NO: 


i 






1 


i 






1 245471 


245Cl| 


8 




1 253641 


1 


i 








272541 




276311 










1 288141 






28795 






Exon 
SEQ ID 
NO: 


141321 


14249 




1 


1 


14745 


1 


g 


i 


i 


15309 


15317 


15779 






1 


16472 


17064 


17415 


17416 


1 


i 


18241 


18332 


1 






1 




1 






























































Probe 
SEQ ID 
NO: 


42341 


9 


4361 


1 


1 


1 


1 




1 


8 


s 


i 






6157 


5571 




7187 


1 


1 


1 


S3 


s 


1 


1 87131 


i 




5 


1 95651 
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Top Hit Descriptor 


|6O2083269F1 NIH_MGC_81 Homo sapiens cDNA done IMAGE:4247537 5' | 


|CMO-HT0565-280200-245-b10 HT0565 Homo sapiens cDNA | 


1 
I 
£ 

i 
1 

? 

3 

1 




[Arabidopsis thaliana AP2 domain containing protein RAP2.12 mRNA, partial cds | 


[THROMBOSPONDIN 1 PRECURSOR | 


[SEX HORMONE-BINDING GLOBULIN PRECURSOR (SHBG) (SEX STEROID-BINDING PROTEIN) 1 
l(SBP) (TESTIS-SPECIFIC ANDROGEN-BINDING PROTEIN) (ABP) | 


[IL3-CT0219-160200-064-F10 CT0219 Homo sapiens cDNA | 






ii 

E 

Si 

E 

8 c 

I- 
i 

f ^ 

E 

il 
1- 

i 
s 

i i 




i 

i 
1 

< 

1 

s 
§ 


< 

i 
s 

1 

s 

i 

i 

i 


Influenza 8 virus (B/Nanchang/480/94) NB protein gene, complete cds; and neuraminidase gene, partial cds | 


1 Archaeoglobus fulgidus section 68 of 1 72 of the complete genome | 




IGLUTAMATE-CYSTEINE LIGASE REGULATORY SUBUNIT (GAMMA-GLUTAMYLCYSTEINE 1 


> 
t 

1 

i 

1 


z 

i 


IHYPOTHETICAL 51.7 KD PROTEIN IN THRC-TALB INTERGENIC REGION (ORF8) ] 


|EST382376 MAGE resequences, HMGK Homo sapiens cDNA ] 


[Homo sapiens HARP (HARP) gene, exon 17 and complete cds | 


|wr52c08.x1 NCLCGAP.UtI Homo sapiens cDNA clone 1MAGE:2491310 3' ] 


IUI-HF-BNO-akk-d-05-0-Ul.r1 NIH_MGC_50 Homo sapiens cDNA clone IMAGE:3077409 5' ] 


IUI-HF-BNO-akk-d-05-0-Ul.r1 NIH_MGC_50 Homo sapiens cDNA clone IMAGE:3077409 5' | 


|oo85g12.s1 NCI CGAP Kid5 Homo sapiens cDNA clone IMAGE:1573030 3' similar to gb:M26062 1 
llNTERLEUKlN-2 RECEPTOR BETA CHAIN PRECURSOR (HUMAN); | 


|C1 6800 Clontech human aorta polyA+ mRNA (#6572) Homo sapiens cDNA clone GEN-529H09 5' | 


iPangasianodon gigas growth hormone (GH) mRNA, complete cds | 


Homo sapiens mRNA for ASK1 , complete cds | 


Top Hit 
Database 
Source 




|EST HUMAN | 


lEST HUMAN 1 


z 


z 


[SWISSPROT 1 


SWISSPROT 1 


[EST HUMAN | 


z 


z 


z : 


E z 




z 


z 




z 




[SWISSPROT 1 


[SWISSPROT 1 


[SWISSPROT 1 






[EST HUMAN | 


[EST HUMAN | 


[EST HUMAN | 


EST HUMAN | 


[EST HUMAN | 






TopHitAcession 
Na 


1 

8 


IBE173796.1 1 


IBE1 73796.1 1 


|AL1 61 560.2 1 








[AW850754.1 | 


[U65016.1 1 


i 


i i 


|aJ276505.1 | 


i 


g 


AF134907.1 1 


[AE001 039.1 1 


] 11417236] 




§ 
1 


1 

a 


[P30143 ] 


[AW970295.1 ] 


[AF21084Z1 ] 


[AI973157.1 ] 


[AW5Q0611.1 ] 


[AW5Q0611.1 ] 


AA970317.1 


1 

5 


[L27835.1 ] 


Q 


Most Similar 
(Top) Hit 
BLAST E 
Value 


5 


.6E-01 


.5E-01 


.5E-01 


i 


1 




i 


.5E-01 


1.0-39 ■ 


.5E-01 


.5E-01 




5 


.5E-01 


.5E-01 


.5E-01 




.5E-01 


.5E-01 


.5E-01 


9 


9 


9 


s 


S 


.5E-01 


.5E-01 


.5E-01 




























































Expression 
Signal 






































§ 




i 


P: 










is 


s 




ORFSEQ 
ID NO: 


1 22435] 


1 243611 


1 243621 




1 24716| 






1 253671 




1 






25774 




1 


[ 25948] 


1 25963] 


25971 


[ 260001 


1 260671 


1 






1 26507] 












.. 


I2544| 


§ 


S 




14942 




15188 


s 


Si 










i 




i 




s 


s 


15935 




15148 


i 


s 




i 


16774 


i 






























































Probe 
SEQ ID 
NO: 




1 


1 


1 


5072 


1 


1 


i 


5424 


5424 


M 


11 




i 


1 


i 


1 


i 


i 


i 




1 


6374 


i 


i 


1 


1 


i 


i 
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Top Hit Descriptor 




M.musculus p"6K gene for 1 6 l<Da protein | 


EphydaSa fluviatilis mRNA for aldolase, partial cds | 


1 
1 

I 


Rattus norvegicus desmin (Des). mRNA | 


301 31 5638F1 NIH MGC 8 Homo sapiens cDNA clone 1MAGE:3634329 5' | 


transformylase (GART) genes, complete cds | 




TYROSINE-PROTEIN KINASE TRANSFORMING PROTEIN ABL | 


501465575F1 N1H_MGC_67 Homo sapiens cDNA clone 1MAGE;3868795 5' | 


% 

s 

5 

1 
1 


Mus musculus mRNA for prolidase, complete cds | 


< 

j 

1 

1 
1 

1 


Homo sapiens G protein-coupied receptor 50 (GPR50) mRNA | 


Homo sapiens G protein-coupled receptor 50 (GPR50) mRNA | 


Homo sapiens gene for NBS1, complete cds | 


Human calicivirus HU/NLV/Girlington/93/UK RNA for capsid protein (ORF2), strain HU/NLV/Girlington/93/UK| 


Human cdicivirus HU/NLV/Glrlington/93/UK RNA for capsid protein (0RF2), strain HU/NLV/GirIington/93/UK| 


P.dumerilii histone gene cluster for core liistones H2A, H2B, H3 and H4 | 


Rattus norvegicus A-kinase anclior protein mRNA, complete cds | 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation | 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation | 


AV71 2467 DCA Homo sapiens cDNA clone DCAAFF05 5' | 


Homo sapiens adapter protein CMS mRNA, complete cds | 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen deprivation | 


genes and 0RF1 51 | 


RC4-ST0173-191099-032-d12 ST0173 Homo sapiens oDNA | 


Arcinaeoglobus fulgidus section 91 of 1 72 of tlie complete genome | 


Carassius auratus keratin type 1 mRNA, complete cds | 


Top Hit 
Dataijase 
Source 




NT 1 


NT 1 


NT 1 


NT 1 


EST HUMAN | 


NT 1 


NT 1 


SWISSPROT 1 


EST HUMAN 1 


NT 1 


NT 1 


EST HUMAN 1 


NT 1 


NT 1 


NT 1 




NT 1 


IN 


NT 1 


NT 1 


NT 1 




NT 


NT 1 


NT 


EST HUIVIAN 1 


NT 


NT 1 


Top HitAcession 
No. 


|u28760.1 1 


X52102.1 1 


1 
1 




i 


1 




D64004.1 1 


PI 0447 1 


BE782738.1 | 


1 


Q 


AW377998.1 1 


47584671 




:AB013139.1 | 


i 

s 


1 
Em 

2 


X53330.1 1 


AF1 3951 8.1 1 


5 


ALII 5265.1 1 


AV71 2467.1 1 


:AF1 46277.1 1 


|AL1 17078.1 1 


|aJ243578.1 | 


|AW81 2104.1 1 


1 




Most Similar 
(Top) Hit 
BU^STE 
Value 


.4E-01 


.4E-01 


3 


S 


s 


!^ 


.4E-01 


.4E-01 


.4E-01 


.4E-01 


.4E-01 


.4E-01 


.4E-01 


.3E-01 


.3E-01 


.3E-01 




.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


.3E-01 


s 


.3E-01 




























































Expression 
Signal 


2.23 


3.021 


1.48| 








3.01 1 




3.28| 








1.77| 


§ 








0.89 1 






s 


cvi 








J 








ORF SEQ 
ID NO: 


28158 




251 38 1 
























§ 


20271 1 


i 


20370 1 




206521 










1 








i 


Exon 
SEQ ID 
NO; 


17913 




9517 




i 








9754 




1 


1 


1 


1 


0281 


1 


10557 


10557 




I 


1 






1331 




12012 


2129| 
































































Probe 
SEQ ID 
NO; 


8764 


s 




i 


9427| 


1 




1 






9831 




1 








s 


g 
















1 2124 


22451 




2542| 
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S 
is 

1 




1 












TR:Q15571 1 


3N FACTOR | 












4048 Q14048 
.repetitive 




3 




















i 
















3 














of 




s 
























fe 












1 
m 


ANSCRI 




2734554 








lilartoTF 
slement F 




1 












in 

s 


8 






o 

i 


to 


E 
1 












lg 


rCELLTR 


i 


,e IMAGE: 








1 .8 




i 




Top Hit Descriptor 


|MR4-BT0358-130700-010-il08 BT0358 Homo sapiens cDNA 


[Homo sapiens dopamine transporter (SLC6A3) gene, complete cds 


|Mus musculus cofilin 2, muscle (Cfl2), mRNA 


|60115Sa52F1 NIH_MGC 21 Homo sapiens cDNA clone IMAGE:35041 


1 601 462741 F1 NlH MGO 67 Homo sapiens cDNA clone IMAGE:3866( 


[Gallus gallus scycl gene for lymphotacfin, exons 1-3 


lEphydatia fiu'/iatilis mRNA for sALX-6, complete cds 


TR:O60287 060287 KIAA0539 PROTEIN. ; 


^02078440F1 NIH_MGC_62 Homo sapiens cDNA clone 1MAGE:4253( 


tf39b02.x1 NCLCGAP_Brn23 Homo sapiens cDNA clone IMAGE:209I 
ANNEXIN V (HUMAN); 


1 Dictyos'.elium discoideum ORF DG1016 gene, partial cds 


Homo sapiens colon cancer antigen NY-CO-45 mRNA, partial cds 


|AU1491 46 NT2RM4 Homo sapiens cDNA clone NT2RM4001691 3' 


AU149146 NT2RM4 Homo sapiens cDNA clone NT2RM4001 691 3' 


1 

1 
1 

E 
X 

? 
1 


al48e09.s1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAC 
Q1S671 ANTI-MULLERiAN HORMONE TYPE II RECEPTOR PRECl 


NUCLEAR FACTOR OF ACTIVATED T-CELLS, CYTOPLASMIC 4 (1 
NFAT3) (NF-ATC4) (NF-AT3) 


qt59f09.x1 NCI CGAP Eso2 Homo sapiens cDNA clone 1MAGE:1960 


i 

1 

1 

i 
0 

■•i 


i 

a 

§ 
< 

1 

i 

i, 
i 




|QV3-BN0046-220300-129-f10 BN0046 Homo sapiens cDNA 


ts18g07.x1 NCLCGAP_Pan1 Homo sapiens cDNA clone 1MAGE:222£ 
COLLAGEN VI ALPHA-2 ALTERNATIVE C-TERMiNAL DOMAIN. [1] 
element ; 


jHuman E1A enhancer binding protein (E1A-F) mRNA, partial cds 


as80c09.x1 Barstead colon HPLRB7 Homo sapiens cDNA clone IMAG 
60S RIBOSOMAL PROTEIN L30 (HUMAN); 




Top Hit 
Database 
Source 


HUMAN 






■ HUMAN 1 


HUMAN 1 






HUMAN 


HUMAN 1 


HUMAN 






HUMAN J 


_HUMAN 1 


HUMAN 1 


_HUMAN 


O 
IT 


HUMAN 


HUMAN 


HUMAN 1 




HUMAN 1 


HUMAN 




HUMAN 1 






















[a 












[S 


[S 


i 


to b 


















TopHitAcession 
No. 


i 


< 


i 




1 


IAJ242790.1 1 


i 
§ 
< 


AW001114.1 


1BF571 764.1 j 


A1421744.1 


i 


1AF039442.1 | 


|AU149146.1 1 




IAV735249.1 | 


§ 


1 

a 


S f 
< > 


4 
5 1 

II 


i 


i 
< 


1 
< 


1 
< 


IU18018.1 1 


AI720470.1 1 


M16364.1 1 


Similar 

3TE 

lue 


s 


s 


s 


s 


s 


S 


s 


s 


s 




i 




«? 


s 




9 


i 


1.2E-0lj 


? 9 




9 




9 


i 


i 


1 1.2E-01I 


MostJ 

BLA! 

Val 










































Expression 
Signal 




s 










s 










s 












S 1 










1 






§ 


a 

to o 




i 




i 


i 
















211171 








21377I 


21396 




21919 




1 






22594 


22628 


K Q 

o 






















































Exon 
SEQ ID 
NO: 


a 






i 


i 


I 19054| 




i 




i 


i 


10476 






1 






i I 




5^ 


12124 








§ 


i 


lis 


i 






8873 


i 




9757 


i 


i 








1355 






1 




i 1 


"li^T^ 


2134 




2546 


2697 


2812 


2872 
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Top Hit Descriptor 




QV1-BT0259-261099-021-d05 BT0259 Homo sapiens cDNA | 


Methanococcus jannasciiii section 142 of 150 of the complete genome \ 


Bacillus subtilis complete genome (section 15 of 21): from 2795131 to 3013540 | 


W heat mRNA for a group 3 late embryogenesis abundant protein (LEA) | 


Wheat mRNA for a group 3 late embryogenesis abundant protein (LEA) | 


Bacillus subtilis complete genome (section 15 of 21): from 2795131 to 3013540 | 


601 81 0786R1 N1H_MGC_46 Homo sapiens cDNA clone IMAGE:4053668 3' | 


p.clarkii mRNA; repeat region (ID 2MRT7) | 


p.clarkii mRNA; repeat region (ID 2MRT7) | 


HEMOLYSIN PRECURSOR | 


HYPOTHETICAL 52.4 KD PROTEIN C12B10.08C IN CHROMOSOME 1 | 


HYPOTHETICAL 52.4 KD PROTEIN C12B10.08C IN CHROMOSOME 1 | 


UI-HF-BKO-aah-d-01-Q-Ul.rl NIH MGC 36 Homo sapiens cDNA clone IMAGE:3053617 5' | 


i 

S 
1 

1 
< 

1 
t 

i 
1 

1 

! 


Homo sapiens gene encoding plakophilin (exons 1-13) 1 


601 49351 8F1 N1H_MGC_70 Homo sapiens cDNA clone IMAGE;389561 3 5' | 


1LO-CTQ031-221099-1 13-604 CT0031 Homo sapiens cDNA | 


1 
1 

1 
1 

1 
1 


1 
I 

i 
1 


woS9g03.x1 NCI CGAP Co3 Homo sapiens cDNA clone IMAGE:2326804 3' similar to SW:GST2 HUMAN ] 
Q99735 MICROSOMAL GLUTATHIONE S-JRANSFERASE 11 ; 


xc49d07.x1 NCI CGAP Eso2 Homo sapiens cDNA clone 1MAGE:2587597 3' similar to gb:M13452 U\MIN A| 
(HUMAN); 




Haemophilus influenzae Rd section 29 of 1 63 of the complete genome 


S.cerevisae HXT5 gene | 


AV710857 Cu Homo sapiens cDNA clone CuAAKEOS 5' | 


Yeast MPT5 gene for suppressor protein, complete cds | 


i 

1 
< 

1 

1 

I 

o 

S| 
2 

§ 


601900763F1 N1H_MGC_19 Homo sapiens cDNA clone IMAGE:4130103 5' ] 


Homo sapiens dynein intermediate chain DNA11 (DNAI1) gene, exon 17 ] 


Top Hit 
Database 
Source 




EST HUMAN | 


Z 


2 








EST_HUMAN | 


z 




SWISSPROT 1 


SWISSPROT 1 


SWISSPROT 1 


EST HUMAN | 


EST HUMAN 




EST_HUMAN 1 


EST HUMAN | 




EST HUMAN ] 


EST HUMAN 


EST HUMAN 








EST HUMAN ] 




EST HUMAN ] 






TopHitAcession 
No. 




< 


U57600.1 1 


i 


X56882.1 1 


i 


i 






Z54255.1 1 




5 


Q10441 1 


AW401835.1 1 






BES20945-1 1 


AW845275.1 | 


M26925.1 1 


BE0D7Q72.1 | 


AI91 3753.1 


AW083652.1 


AF053772.1 


U32714.1 1 


ft 


1 


□261 84.1 1 




BF31 4481.1 1 


AF1 90493.1 1 


IVIost Similar 
(Top) Hit 
BLAST E 
Value 


.2E-01| 


.2E-01 


.2E-01 


.2E-0T1 


.2E-01 


.2E-01| 


.2E-0l| 


.2E-0l| 


.2E-01| 


.2E-01| 


.2E-01 1 


.2E-01| 


.2E-01I 


1 


.2E-01 


.2E-01I 


.2E-01I 


.2E-01| 


.2E-0l| 


.2E-01 ] 


.2E-01 


S 




.2E-01 


3 


.2E-01| 


S 


.2E-01] 


.2E-01 


.2E-01I 






























































P 






















s 












i 


2.19| 






2.46| 














3.35] 






ORFSEQ 
ID NO: 




22923 1 






g 










23767| 




248071 


24808] 






250211 




25824 


25858] 












27546] 










28715] 


Exon 
SEQID 
NO: 


i 




3143 


S 


3393 


i 


S 










? 

s 


5040 


i 






1 




5745 




16522 




16782 


§ 


7340 


s 




i 




8447| 






























































Probe 
SEQID 
NO: 


1 29741 


! 31921 


1 


1 


34771 


3477 


3554 


s 






5012 


5174 


5174| 


g 


5251 


1 


5762 


§ 


1 


i 


1 


1 


i 


70431 


I 7521 






8417 


1 


8579| 
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85-RON) 1 


















i' 




































Top Hit Descriptor 


-S 
■5. 
E 

8 
< 

1 

5 

1 
1 

1 
1 

i 
1 

i 


IAV658033 GLC Homo sapiens oDNA clone GLCFIB12 3' 




IMACROPHAGE-STIMULATING PROTEIN RECEPTOR PRECURSOR (MSP RECEPTOR) (PI 
|(CDW136) (CD136 ANTIGEN) 


Homo sapiens colon cancer antigen NY-CO-45 mRNA, partial cds 




IQV4-BT0234-11 1199-031-glO BT0234 Homo sapiens cDNA 


|qn20g05.x1 NCLCGAP_Lu5 Homo sapiens cDNA clone IMAGE:1898840 3' 


|XenopuE laovis integrin alpha 3 subunit mRNA, partial cds 


1 CYCLIN T 


601900763F1 N1H_MGC_19 Homo sapiens cDNA clone 1MAGE:4130103 5' 


1 
< 

i 
< 

i 

f 

i 

i 

a} 
8 

s 

1 

i 


1 

i 
1 

1 

1 
g 

1 

E 

i| 

11 

o 

§ s 


i 

D 
1 
t 

i 

% 
i 


ArabidopsiE thaliana DNA chromosome 4, oontig fragment No. 60 




ISynechocystis sp. PCC6803 coTiplete genome, 23/27, 2868767-3002965 


|AU140363 PU^CE2 Homo sapiens cDNA clone PU\CE20004O3 5' 


Mus musculus pre T-cell antigen receptor alpha (Ptcra). mRNA 


iRattus norvegicus Procollagen 11 alpha 1 (Ccl2a1), mRNA 


|RC0-ST0379-210100-032-g04 ST0379 Homo sapiens cDNA 




|HSC1RF022 normalized infant brain cDNA Homo sapiens cDNA clone c-1 rf02 3' 




|601308679F1 NIH MGC 44 Homo sapiens cDNA clone IMAGE:3527066 5' 


1 C.relnhardlll nuclear gene on linkage group XIX 


A.immersus gene for transposase 




|MR3-ST0290-290100-025-g07 ST0290 Homo sapiens cDNA ' 


|MR3-ST0290-290100-026-g07 ST0290 Homo sapiens cDNA 


Top Hit 
Database 


2 




z 


SWISSPROT 1 


1- 
z 


z 


|est_human 1 


|EST HUMAN | 


z 


ISWISSPROT 1 


[EST HUMAN | 


|EST HUMAN | 


i 


|EST HUMAN 1 


z 


[EST HUMAN | 


z 


|EST HUMAN | 


z 




[EST HUMAN | 


z 


|EST HUMAN | 




|EST HUMAN | 


z 




z 


|EST HUMAN j 


jEST HUMAN | 


TopHHAcession 
No. 


IM65109.1 1 


i 


iAJ271 736.1 1 


Q04912 1 


1 


1X53981.1 1 


|BE051418.1 1 




IL10187.1 1 




IBF31 4481.1 1 


1 1.E001.99IVI 




1 


1AL16155C.2 1 


IAW972158.1 1 


|D64004.1 1 


!AU1 40363.1 1 




1 


[AW821909.1 1 


IS82418.1 1 


IF03265.1 1 


6753231 1 


i 
1 


1X62135.1 1 


i 


1X52708.1 1 


IAW819412.1 1 


|AW81 9412.1 1 


Similar 
i)Hit 
3TE 
lue 


.2E-01 


.2E-01 


.2E-01 


.2E-01 


.2E-01 


.2E-01 


9 


s 


s 


lis 


3 


9 


s 


9 


.1E-01 


s 


9 


s 


9 


.1E-01 


s 


9 


s 


9 


3 


9 


9 




3 


9 
































































ll 
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1 2.02i 




1 2.78| 


S 










1 2.071 




1 2.181 








































ORFSEQ 
ID NO: 








2489?! 






1 


1 252311 








203011 


20344| 


207991 




i 




i 








225451 


1 227171 




1 










i 




18612 




s 


1 




1 


1 


1 


1 


i 


19424 


10493 


10535 


i 


i 


i 






g 




1 


12754 




13221 


1 




13450 
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s 
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s 
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Top Hit Descriptor 


yg98f07.r1 Scares infant brain 1NIB Homo sapiens cDNA clone IMAGE:41618 6' 1 


MAJOR EPIDIDYMIS-SPECiFIC PROTEIN E4 (EPIDIDYMAL PROTEIN BE-20) 


nf79e01.s1 NCI_CGAP_Co3 Homo sapiens cDNA clone IMAGE:926136 3' 1 


Mua musculus pre T-cell antigen receptor alpha (Ptcra), mRNA 


Human herpesvirus 1 strain KOS-63, latency-associatecl transcript, promoter region 1 


600944365F1 N1H_MGC_17 Homo sapiens cDNA clone IMAGE:29601 76 5' 1 


1 

i 
1 

d 


ya99c09.r1 Stratagene placenta (^37225) Homo sapiens cDNA clone IMAGE:69808 5' similar to similar to 
gb:X56009 GUANINE NUCLEOTIDE-BINDING PROTEIN G(S), ALPHA SUBUNIT (HUMAN) | 


H.vulga-e xylose Isomerase gene | 


0. cuniculus Wl 2 keratin gene | 


6-PHOSPHOFRUCTOKINASE (PHOSPHOFRUCTOKINASE) (PHOSPHOHEXOKINASE) | 


PM2-BT0349-1 61299-001-f02 BT0349 Homo sapiens cDNA | 


Arabidopsis thaiiana DNA chromosome 4, contig fragment No. 54 1 


Homo s^ens MSH55 gene, partial cds; and CLIC1, DDAH, G6b, G6c, G5b, G6d, G6e, G6f, BATS, G6b, 
CSK2B, BAT4, G4, Apo M, BATS, BAT2, AIF-1 , 1 C7, LST-1 , LTB, TNF, and LTA genes, complete cds 


au74a05.y1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE;2781968 5' | 


FB19F10 Fetal brain, Stratagene Homo sapiens cDNA clone FB19F10 3'end [ 


Bacteriophage Mu, complete genome | 


zp38h12.5l Stratagene muscle 937209 Homo sapiens cDNA clone IMAGE:61 1783 3' similar to 1 
SW:TRT3_HUMAN P45378 TROPONIN T, FAST SKELETAL MUSCLE, ISOFORM BETA ; 


Homo sapiens partial MUC3B gene for MUC3B mucin, axons 1-11 1 


FOLATE RECEPTOR ALPHA PRECURSOR (FR-ALPHA) (FOLATE RECEPTOR 1) (FOLATE 
RECEPTOR. ADULT) (ADULT FOLATE-BlNDING PROTEIN) (FBP) (OVARIAN TUMOR-ASSOCIATED 
ANTIGEN MOV18) (KB CELLS FBP) 


hv39g1 0.x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone IMAGE:3175842 3' similar to contains AIu | 


H IV-1 p8cD95-36 from USA envelope glycoprotein (env) gene, partial cds | 


H IV-1 pBc095-06 from USA envelope glycoprotein (env) gene, partial cds 1 


Dictyostelium discoideum spore coat structural protein SP65 (cotE) gene, complete cds 1 


cortioosteroid-binding globulin [Saimiri sclureus=squirrel monkeys, liver, mRNA, 1474 nt] 1 
LAMININ BETA-2 CHAIN PRECURSOR (S-LAMININ) 1 


III 


UMAN 1 


PROT 1 


i 






z 

1 










0 


UMAN 1 








Z 
1 








PROT 


Z 
1 

3 








PROT 1 




EST h 


SWISS 


EST. h 


NT 


NT 


EST 1- 


NT 


EST. ,h 


NT 


NT 


331 MS 1 


EST 1- 


NT 




ESTh 


ESTh 


NT 




NT 


3SIMS 


EST H 


NT 


NT 






i 

1.. 
























i 










i 












AF138522.1 1 






Top Hit A 
Nc 


S 

S 
a: 


i 

a 


AA53435^ 




§ 
D 


§ 

CQ 


1 




i 

1 


i 


1 


1 


S 
5 




i 


i 




AA1 79901 


AJ291390 


PI 5328 




AF1 38522 




il 


milar 
Hit 






S 


S 


E-02 


E-O2I 


E-02 


E-02 


E-02 


E-02 




E-02 


E-02 


E-02 




E-02 




E-02 


E-02 


g 


E-02 


E-02 


E-02 


E-02 
£-02 


:-02j 






















































g 

II 

r 




































I.52I 






i 




1.11J 






ORF SEQ 
ID NO: 






i 








2421 3 1 






i 






1 


25487 














21379I 


22528 1 




ii 




Exon 
SEQ ID 
NO: 






1 




i 


§ 


1 


0999. 


i 


i 


i 


13532| 


142791 




s 










S 
g 


1152l| 




126341 




5 g 


Probe 
SEQ ID 
NO: 


1 2180 


1 3142 


S 


1 3537, 


4145 


4211 


4538 


1 


I 




s 

8 




4383 


i 


6372 


i 


9021 




i 




I6I7I 


2772 


27721 


32941 
4202 


42021 
4321 1 
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Top Hit Descriptor 


1 Plasmodium falciparum P-type ATPase 3 gene | 


o 

§ 


za58a12.r1 SoareE_fetalJung_NbHL19W Homo sapiens cDNA clone 1MAGE:297694 5' similar to 1 
PIR:S521 71 S521 71 small G protein - human ; | 


J Homo sapiens chromosome 16 open reading frame 5 (C16orf5), mRNA | 


1 5021 29030F2 NIH_MGO_56 Homo sapiens cDNA clone 1MAGE:4285951 5' | 


|502129030F2 NIH_MGC_56 Homo sapiens cDNA clone IMAGE:4285951 5' | 


1 PM0-HT0339-251 1 99-003-d01 HT0339 Homo sapiens cDNA | 


[Africhum angusfatum AtranRo2 protein (AtranFlo2) gene, partial cds | 


i 

1 

i 
1 

z 

S 
t 
i 

1' 

1 

1 


i 
i 

1 

1 

Q 

■1 

s 
i 

£ 

1 

0 

s 

2 

A 

f 


UI-H-BI3-alo-f-08-0-UI.El NCi_CGAP_Sub5 Homo sapiens cDNA clone IMAGE:30e8294 3' 1 


1 
§ 

8 

? 

1 

1 

i 


FOLD BIFUNCTIONAL PROTEIN [INCLUDES: METHYLENETETRAHYDROFOLATE 1 
DEHYDROGENASE ; METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE ] | 


1 H. sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA20F8 | 


1 EST1 80187 Liver, hepatocellular carcinoma Homo sapiens cDNA 5' end | 


502129682F1 NIH MGC 56 Homo sapiens cDNA clone IMAGE:42861 80 5' | 


PROBABLE DNA LIGASE d^-OLYDEOXYRIBONUCLEOTIDE SYNTHASE [ATP]) 


1 EST11 595 Uterus Homo sapiens cDNA 5' end | 


TRANSCRIPTION INITIATION FACTOR TFIID 135 KDA SUBUNIT (TAFII-135) (TAFII135) (TAFII-130) 1 
|(TAFI1130) 1 


1 Homo sapiens paired box gene 6 (aniridia, keratitis) (PAX6), isoform b, mRNA 1 


1 zn99a05.s1 Stratagene colon (#937204) Homo sapiens cDNA clone IMAGE:566288 3' | 


1 601 1 91 770F1 NIH_MGC_7 Homo sapiens cDNA clone IMAGE:3535648 5' | 


I601191770F1NIH MGC 7HomosapienscDNAclonelMAGE:35356485' | 


!DKFZp434D1313_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434D1313 5' | 




Homo sapiens zinc finger protein 92 (ZFP92), expressed-Xq28STS protein (XQ280RF), and biglycan (BGN) 
genes, complete cds; and plasma membrane calcium ATPase isoform 3 (PMCA3) gene, partial cds 


1 Homo sapiens zinc finger protein 92 (ZFP92), expressed-Xq28STS protein (XQ280RF), and biglycan (BGN) 
genes, complete cds; and plasma membrane calcium ATPase isoform 3 (PMCA3) gene, partial cds 


Top Hit 
Database 
Source 


NT J 


ISWISSPROT 1 


|eST HUMAN 


|NT ] 


|EST HUMAN ] 


]EST HUMAN ] 


lEST HUMAN 


iNT ] 


lEST HUMAN 


]EST HUMAN 1 


lEST HUMAN 1 


iNT ] 


IsWISSPROT 1 


NT J 


]EST HUMAN ] 


]est human ] 


]SWISSPROT ] 


]EST HUMAN ] 


SWISSPROT 1 


NT 


]EST HUMAN j 


]EST HUMAN J 


]EST HUMAN 1 


Jest human 1 


NT 


NT 


NT 


TopHitAcession 
No. 


i 


IQ24597 1 


W56037.1 1 


) 11431759] 


















P47259 1 


IZ79021.1 1 




§ 


1Q27474 ] 


i 


§ 
0 


1 4580423] 


|AA1 51 872.1 1 


I 


IBE264455.1 ] 


1 


g 


1 


U82695.2 


1 
1 


IBF701593.- 


]BE1 53572.- 




]AA424S87,' 


1 AW 4521 22 


)AW452122 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 9.0E-02I 


1 9.0E-02I 


1 9.OE-O2I 




1 8.9E-02I 


9 

s 


1 8.9E-02I 


] 8.9E-02I 




] 8.9E~02] 


] 8.9E-02] 


] 8.9E.02I 


1 8.9E-O2I 




] 8.9E-02I 


\ 8.9E-02I 


] 8.8E.O2I 


] 8.8E-021 


8.8E-02I 


1 8.8E-02I 


] 8.8E-O2I 


] 8.8E-O2I 


1 8.8E-O2I 


] 8.8E.02I 


] 8.8E-02I 


8.7E-02 


8.7E-02 


.i_ 

!> .5) 

r 


















s 












^ 




H 




1 3.24 


1.13 


s 






1 


























ORF SEQ 
ID NO; 


1 24248] 


1 247801 






g 


1 


1 22117] 




1 24208] 


255591 


§ 


s 


1 




i 






235321 






1 272501 


g 




1 28722] 


] 25332] 


23337 


23338 


Exon 
SEQ ID 
NO: 


s 


1 15009] 


i 


1 






1 12218] 


] 14004] 


1 14427] 


i 


1 


i 


I6135I 


t 163201 


16875 


1 




g 


13855 


14104 




1 18334] 


18334 


] 18453] 


1 19001] 


13550 


S 


Probe 
SEQID 
NO: 


i 


1 5142] 


1 5547 1 


& 


1419 


1419 


2338 


1 4104] 


4534 


i 


1 




1 6270 1 




i 


1 


1 13511 


1 38281 


3J 


4205 


7195 


s 


1 


8585 






i 
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Top Hit Descriptor 


zh58a02/1 Scares Jetaljver_spleen_1 N FLS_S1 Homo sapiens cDNA clone IMAGE:41 7104 5' similar to 1 
1 contains element MERl 2 repetitive element ; | 


Mus musculus JNK interacting protein-3a (Jip3) mRNA, complete eds | 


Methanobacteriumthermoautotropliicum from bases 1176181 to 1 1 89406 (section 101 of 148)ofthe j 


zs55g08.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:701438 3' | 


zs55g08.s1 NCLCGAP_GCB1 Homo sapiens cDNA clone IMAGE:701 438 3' | 


Orvctolagus cuniculus cytochrome P-450 (CYP4A4) gene, 5' end | 


Gluconobacter oxydans tRNA-lle and tRNA-Ala genes | 




Mus musculus nidogen 2 (Nld2). mRNA | 




S01304016F1 NIH_MGC_21 Homo sapiens cDNA clone IMAGE:3638643 5' | 




□ictyostdium discoideum adenyiyi cyclase (acrA) gene, complete cds | 


Oryctolagus cuniculus galecSn-3 gene, untranslated exon and 3 flanking region 1 


Homo sapiens LCNIb gene | 


Mouse germline IgM chain gene, D region; □-q52, mu switch region (part a) | 


Mouse germline IgM chain gene, D region; D-q52, mu switch region (part a) 1 


Homo sapiens Snf2-related CBP acfvator protein (SRCAP) mRNA | 


Homo sapiens Snf2^elated CBP activator protein (SRCAP) mRNA | 






501893437F1 NIH_MGC_17 Homo sapiens cDNA clone 1MAGE:4139216 5' | 


501893437F1 NIH MGC 17 Homo sapiens cDNA clone IMAGE:4139216 5' . | 


Archaeofilobus fulgidus section 34 of 172 of the complete genome | 




Helicobacter pylori 26595 section 130 of 134 of the complete genome 1 


M PROTEIN, SEROTYPE 6 PRECURSOR | 


"8 

1 
i 

E 
1 

3 

1 

8 
■& 
J 

i 
1 


Mus musculus myosin XV (Myol 5), mRNA | 


Top Hit 
Database 


EST HUMAN 






EST HUMAN | 


Z 

i 








z 




EST HUMAN 1 




z 


z 




z 




z 


z 


z 


z 


EST HUMAN 1 


EST HUMAN | 


z 


z 


z 


SWISSPROT 1 






1 




































5730066| 


§ 
1 




















54779 


Top Hit Ao 


i 
1 




AE00Q895.' 


i 

s 


AA286875. 


L04758.1 


AJ007763.1 






i 


BE408667/ 


L05468.1 


AF1 53362.- 


U68179.1 


Y10825.1 


J00440.1 


i 

1 




1 
< 


AF206551.' 






AE001073. 


AF283660.' 


AE000662. 




< 




milar 
Hit 
TE 


I 


E-02 


9 


9 


E-02 


E-02 


9 


9 




s 


s 


9 


9 




E-02 


9 


3 


9 


9 




3 




3 


9 


i 


s 


3 


9 


3 


Most Si 
(Top) 
BLAS 
ValL 




























































Expression 
Signal 






P 








f= 






























i 














! 


ORFSEQ 
ID NO: 


1 23578 






1 


249521 




287701 






20991 


1 


22876 




240S0I 


2571 8 1 


i 


§ 




1 


28441 1 


1 


287101 
















Isi' 


i 


4497 


i 




$ 


7985 


i 


i 




i 


2084 


3076 


3507 


4281 




5752 


s 


1 




18192 




1 


1 




R 


1 




5570 


i 






























































Probe 
SEQID 
NO: 


8 


1 


5034 


5255 


52551 




i 


1 


9484 


1232 


21971 


3151 




9 


i 


1 


i 


1 


1 


1 8315I 


1 8315! 


8574 


i 


8738 


i 


23471 


1 


S 


7003 1 
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i 

t 






E 
IS 


1 


hrough28, and complete cd 


Pseudomonas putida malonate decarboxylasB gene cluster (mdcA, mdcB, mdcC, mdcD, mdcE, m 
mdcH, mdcL and mdcM genes), complete cds 


i 






i 


i 




s s 


i 


1 






1 

E 










privation 






1 

1 


Mus musculus zinc transporter (ZnT-3) gene, complete cds 


i 

i 

! 

0 
5 
X 

s 
i 


Bos taurus connective tissue growtli factor precursor (CTGF) gene, cc 


RC2-PT0004-031299-011-d05PT0004 Homo sapiens cDNA 


Beet necrotic yellow vein virus RNA-2 


601115055F1 NIH MGC_16 Homo sapiens cDNA clone IMAGE:335! 




Mus musculus epidermal growth factor receptor (Egfr) gene, exons 5 f 

alternatively spliced 


Homo sapiens chromosome 21 segment HS21C002 


EST366723 MAGE resequences, MAGC Homo sapiens cDNA 


Molluscum oonfagiosum virus subtype 1, complete genome 


Human gene for dihydrolipoamide succinyltransferase, complete cds (1 




PM3-BT0347-1 70200-001 -bQ8 BT0347 Homo sapiens cDNA 


[Synechocystis sp. PCC6S03 complete genome, 17/27, 2137259-2267 


< 

■ Q 

'i 
1 

X 

iS 

1 

4 


Dictyoselium discoideum cyclic nucleotide phosphodiesterase gene, ci 


Thermcplasma acidophilum complete genome; segment 5/5 


EST378191 MAGE resequences, MAGI Homo sapiens cDNA 


Homo sapiens cAMP responsive element binding protein-te 2 (CREE 


M.musculus gene for gelatinase B 


complete cds, and small nuclear RNAs (uRNAs) 


Homo sapiens ABCA1 (ABCA1 ) gene, complete cds 


Homo sapiens ABCA1 (ABCA1 ) gene, complete cds 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen de 


H. sapiens AGT gene, intron 4 


H.sapiens AGT gene, intron 4 


111 




i 




i 




1 


















MAN ] 




MAN 1 






1 
















































































i' 






















































TopHitAcession 
No. 




BE897030.1 | 


< 


AW 8751 26.1 1 


X04197.1 1 


BE254318.1 | 


AE002246.2 | 


i 


AB017138.1 


AL1 63202.2 | 


AW954653.1 1 


U60315.1 1 


D26535.1 1 


D26535.1 


BE057219.1 ] 


D90915.1 1 


BF246744.1 | 


1 


AL445067.1 j 


AW966118.1 1 


S 

s 

!? 


X72794,1 1 


M28071.1 1 


i 

< 


i 

< 


ALII 4993.1 1 


X7420S.1 1 


X74208.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


8.2E-02j 


S 


8.2E-02| 


8.2E-02I 


8,2E-02| 




«N 


1 8.2E-O2I 




8.IE-O2I 


8.0E-C2| 


9 


i 


8.0E-02 


s 


8.0E-02j 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


S 


8.0E-02I 


8.0E-021 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


Expression 
Signal 




















I ^ 






10.54] 
































ORFSEQ 
ID NO: 


24S55i 


1 




S 




i 


253361 




21247 


i 


s 


§ 


21440] 




21632] 


221031 




s 


1 


1 








25581 


§ 






27472] 








6073 


i 






i 








1 


i 


i 


i 






2298 


i 


2797 


i 


i 


i 




i 


5506 


§ 


1 


1 




























































Probe 
SEQID 
NO: 


5014 


5271 


s 

5 


7094 


1 


7628 


i 


1 QToe! 


1479 






§ 


i 


1 


s 


2323] 


si 




2870) 


3745 


1 


i 


1 4834 


i 


1 




1 


74011 
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Top Hit Descriptor 


Homo sapiens SCG10 like-protein, helicase^lke protein NHL, M68, and ADP-ribos^ation factor related 
protein 1 (ARFRP1) genes, complete cds | 


|Drosophila orena hunchback region 1 




IMUE musculus Ranbp7 gene, Staf gene and Weel gene ( 


[600943191 F1 NIH_MGC_15 Homo sapiens cDNA done IMAGE:2959510 5' 1 


arS8c08 x1 Barstead colon HPLRB7 Homo sapiens cDNA clone IMAGE:2173646 3' similar to gb:Z26876 
60S RIBOSOMAL PROTEIN L3S (HUMAN); 




1602019770F1 NCI_CGAP_Brn67 Homo sapiens cDNA clone IMAGE:4155401 5' 1 


Arabidopsis thaliana K<M24L mRNA, partial cds ( 


Saccharomyces cerevisiae suppressor of MIF2 Smt4p (SMT4) gene, complete cds | 


ou63b05.s1 NCI_CGAP_Br2 Homo sapiens cDNA done IMAGE:1632465 3' similar to WP:C37A2.2 1 

CE08611 ; 1 


|ou63b05.s1 NCI CGAP Br2 Homo sapiens cDNA done IMAGE:1632465 3' similar to WP:C37A2.2 1 

CE08611 ; 


C 

i 
s 
g 
£ 
1 
1 

c 

j 
c 

i 
1 

i 
0 

z 

•< 




oo59d02.y5 NCLCGAP_Lu6 Homo sapiens cDNA done IMAGE:1570467 5' similar to contains L1 .t3 LI 1 


|PM3-FN0D58-1 40700-005-109 FN0058 Homo sapiens cDNA | 


^00943055F1 NIH_MGC_15 Homo sapiens cDNA done 1MAGE;2959593 5' 


tg48g12.x1 Soares_NFL_T_GBC S1 Homo sapiens cDNA done IMAGE:21 12070 3' similar to contains 1 
MER10.t3MER10 repetitive element; | 






0 
i 

1 

1 
Q 

5 
0 

i 

i 
1 


Homo sapiens WRN (WRN) gene, complete cds | 


jHomo sapiens partial AF^ gene, exons 2 to 7 and Alu repeat elements | 


2u53d11 .r1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:741717 5' similar to 1 
TR:G1173905G1173905SPLICEOSOMEASSOCIATED PROTEIN. ; | 


Top Hit 

Database 
Source 


1 """"^l 


iNT 1 


iNT 1 




[EST_HUMAN | 


EST HUMAN 


^ — 


[EST HUMAN ] 




iNT 1 


EST HUMAN | 


EST HUMAN 


EST HUMAN | 


z 


]EST HUMAN | 


i 

i 


EST HUMAN | 




NT ! 


]EST HUMAN | 


NT J 


NT J 


EST HUMAN | 


Top Hit Acession 
No. 


AF21 7796.1 


[AJ0D5375.1 j 


1 4503034] 


1AJ278435.1 




AI582029.1 


1 i 


IBF348454.1 J 


i 
1 


]U27832.1 ] 




1 

iAIOSI 644.1 


AI793275.1 


AI793275.1 


IBE836331.1 ] 


]BE250048.1 J 


AMI 8520.1 1 


AF233437.1 1 


AF233437.1 | 


IAA469354.1 ] 


1 


i 
2 


i 

§ 
1 


Most Similar 
(Top) Hit 
BUSTE 
Value 


8.OE-O2I 


8.OE-O2I 


8.OE-O2I 


8.OE.O2I 




7.9E-02 




7.9E-02| 


7.9E-02| 


7.9E-02| 


7.9E-O2I 


7.9E-02 


7.8E-O2I 


7.8E-02 


7.8E-02] 


7.8E-O2I 


7.8E-02I 


7.8E-02 


7.8E-O2I 


7.8E-02I 


7.7E-02| 


7.7E-02| 


7.7E-02! 


Signal 










n 












.J 


5.68 


,J 


,J 


0.81 1 




s 


.J 




1.271 








ORFSEQ 
ID NO: 


1 










1 22673 




i 




26756 


27833! 




20947I 


1 


1 243711 




248251 


27306 


27307! 


] 275131 






1 


Exon 
SEQ ID 
NO; 




s 


1 13887] 


i 




u 


? i 


1 




i 










14574 


i 


§ 


1 17113! 








13454 




Probe 
SEQ ID 
NO: 


1 8172 


1 9344I 


i 


i 


1 2127] 




3776 


1 


4706 


S3 

s 




7762 




1 1192 


s 


5019 


1 5197] 


1 




7389 


1 1378] 
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< 






ALPHA 1 












































5 










£ 


E 


















































1 










C6A9) 


f 
§ 






s 










































Dosdherin 4' 


1 
1 










e 


"i 






similar tog 










3385 3' 








in 














Top Hit Descriptor 


[PROBABLE SERINE/THREONINE-PROTEIN KINASE YBR059C 


[Homo sapiens KIAA0628 gene product (K1AA0628), mRNA 


|Homo sapiens interferon regulatory factor 7 (1RF7), mRNA 


|601316425F1 NIH MGC 8 Homo sapiens cDNA clone 1MAGE:3634903 5' 


|EST1 12214 Cerebellum II Homo sapiens cDNA5' end similar to similar to proti 


jHomo sapiens ASCL3 gene, CEGP1 gene, C11orf14 gene, C11orf15 gene, CI 


1RC3-CT0347-1 10300-01 4-a05 CT0347 Homo sapiens cDNA 


[Homo sapiens SCL gene locus 


[Campylobacter jejuni NCTC1 1168 complete genome; segment 5/6 


[QV3-BN004S-1 50400-1 51-e04 BN0046 Homo sapiens cDNA 






[Homo sapiens chromosome 21 segment HS21C078 




1 

g 
1 

1 

i 

E 

?| 

il 


[AU116913 HEMBA1 Homo sapiens cDNA clone HEMBA1 000264 5' 


[RC5-LT0054-260100-011-H09 LT0054 Homo sapiens cDNA 


[Equine herpesvirus 4 strain NS80567, complete genome 




1 
1 

i 

i 
1 

I 

0' 

§, 

1 


iHomo sapiens ADP/ATP earner protein (ANT-2) gene, complete cds 


Rattus norvegicus Activin receptor like kinase 1 (Acvrll), mRNA 




1 

i 

1 
1 

1 

i 

i 

1 




Human periodic tryptophan protein 2 (PWP2) gene, exons 15 to 21, and compl( 


IHomo sapiens histone deacetylase 5 (NY-CO-9), mRNA 


1 CM4-HT0243-081 1 99-037-d1 1 HT0243 Homo sapiens cDNA 


I601453813F1 NIH MGC 66 Homo sapiens cDNA clone IMAGE:3857738 5' 


1601658738R1 NIH_MGC_69 Homo sapiens cDNA clone IMAGE:388a209 3' 


op Hit 

itabase 








HUMAN 1 


HUMAN 




HUMAN 1 






HUMAN j 










HUMAN 


HUMAN 1 


HUMAN 1 






HUMAN 1 








HUMAN 1 


HUMAN 1 






HUMAN J 


HUMAN 1 


HUMAN 1 
















a 




























z 














OT 






TopHitAcession 
No. 












AJ400877.1 


IAW85S844.1 




i 




S 
i 




i 

5 


[AB01 5961.1 I 


AI864367.1 


[AU116913.1 I 


[AW838547.1 | 


IAF030027.1 1 


s 


< 


IL78810.1 1 


1 
§ 


1 




:BE880112.1 1 


i 
§ 


11525S93I 


'AV/379431.1 1 


i 

1 




Most Similar 
(Top) Hit 
BLAST E 
Value 


9 


9 




1 7.6E-02I 


1 7.6E-02I 


1 7.6E-O2I 


1 7.6E-O2I 


s 


1 7.6E-Q2| 


1 7.6E-O2I 


i 


9 


7.5E-02^ 


1 


9 


1 7.5E-02I 


1 7.4E-02I 


1 7.4E-02I 


1 7.4E-02I 


3 


1 7.4E-02I 


1 7.4E-02I 


1 7.4E-02I 


3 


1 7.4E-02| 


1 7.4E-02I 


1 7.4E-02I 


1 7.4E-02] 




I 




1 






t5 
cxi 


i 


i 
















i 








9 


i 


i 






















ORF SEQ 
IDNO: 




1 285041 




23067 












. 29028 1 


20535 


L 20536 


21 649 1 




En 




1 202311 






1 23255[ 


[ 24282! 


24370 [ 


1 




g 








[ 252421 




c e .. 
sag 


175101 


18253! 




13261 


13281 




1 




17545 




1 


1 






16734 


i 




11349 




1 


1 


14573 


14738 


15819 


16492 


1 


18975 




19235 


1 
































































Probe 
SEQID 
NO; 


1 


s 




1 3341 


1 3362 1 




4765 


i 


1 




e 


s 


1878 


1 


1 






1 14441 


1 25361 


1 35451 


1 


1 


i 


1 5913| 


i 


i 


i 


i 


1 
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Top Hit Descriptor 


601343926F1 NIH MGO 53 Homo sapiens cDNA clone 1MAGE:3685951 5' | 


601065194F1 N1H_MGC_10 Homo sapiens cDNA clone IMAGE:3451559 5' | 


Rattus norvegieus bHLH transcription factor Mist1 (Mist1) gene, complete cds | 




AJ230796 Homo sapiens library (Seranski P) Homo sapiens cDNA clone PS13D5 3' | 


O 
1 

1 

i 
1 

i 

i 

o 

1 

1 




CM4-NN1009-200300-116-C11 NN1009 Homo sapiens cDNA 1 


Ihuman immunodeficiency virus type 1 (D9) proviral structural capsid protein (gag) gene, partial cds | 




601872281F1 NIH MGC 53 Homo sapiens cDNA clone IMAGE:4092981 5' | 


601143974F1 NIH MGC 15 Homo sapiens cDNA clone IMAGE:3051 234 5' | 


COLLAGEN ALPHA 1(XVI) CHAIN PRECURSOR 1 


1 
S 
1 

s 


2J66f04.s1 Stratagene colon (#937204) Homo sapiens cDNA clone IMAGE:509599 3' 1 


UI-H-Bl1-acy-c-07-0-Ul.s1 NCLCGAP_Sub3 Homo sapiens cDNA clone IMAGE:2716020 3' 1 


ai65a12.s1 Soares testis NHT Homo sapiens cDNA clone 1375678 3' similar to gb:K03002 60S 1 
RIBOSOMAL PROTEIN L32 (HUMAN); | 


QV4-BT0407-2801 C0-090-e1 0 BT0407 Homo sapiens cDNA | 


CMO-UM0001-060300-270-e12 UM0001 Homo sapiens cDNA | 


Canis familiaris inducible nitric oxide synthase mRNA, complete cds | 


601 81 6291 F1 NIH_MGC_56 Homo sapiens cDNA clone IMAGE:4050071 5' 


African swine fever virus, complete genome ( 


1 
.i 
0 

1 

1 
1 


ah99a05.s1 Soares NFL T GBC SI Homo sapiens cDNA done IMAGE;1 3271 84 3' similar to gb:L1 4837 | 
TIGHT JUNCTION PROTEIN ZO-1 (HUMAN): | 


Homo sapiens chromosome 21 segment HS21C010 j 


Homo sapiens regulator of Gz-selective protein signaling (ZGAP1) mRNA, and translated products 


26S PROTEASOME REGULATORY SUBUNIT S3 (NUCLEAR ANTIGEN 21 D7) | 


26S PROTEASOME REGULATORY SUBUNIT S3 (NUCLEAR ANTIGEN 21 D7) | 


E 

1 
g. 
1 

•a 

1 
8 

1 
1 


1 Top Hit 1 


Database 
Source 


EST HUMAN j 


EST HUMAN | 




EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


z 


EST HUMAN 1 






< 


EST HUMAN | 


& 

1 




EST HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 


i, 


i 


EST HUMAN | 


i 




z 

i 




E ^ 


SWISSPROT 1 


1 
W 




1 

1 

i 

1 


Z 














U82828.1 1 


i 
< 


L02290.1 J 


AE004890,1 1 


BF2C8802.1 ] 


BE304764,1 | 


§ 
§ 




AA056343.1 1 


1 


AA815438.1 | 


BE070264.1 | 


1 

< 


I 

< 


BF3B1 987,1 J 


1 


1 


'aA724295,1 | 


AL163210.2 1 


1 

I 

i 


Q06354 1 


S 

§ 
a 


< 


BE565003.- 


BE539214.' 


1 

s 

< 




AJ230796.1 


1 


is 


BLAST E 
Value 


7.2E-02| 


1 


7.2E-02| 


7.2E-02I 


i 


7.2E.02I 


7.2E-C2| 




7.IE-O2I 


7.IE-C2I 


7.IE-O2I 


7.IE-O2I 


1 


7.0E-02I 


7.QE-O2I 


7.OE-O2I 


7.OE-O2I 


7.OE-O2I 




7.OE-O2I 


7.OE-O2I 


7.OE-02I 


7.0E-02I 


7.OE-O2I 


6.9E-02| 


6.9E-02 


6.9E-02I 


6.9E-02| 


6.9E-02I 


1 

S 


Signal 






















s 


8 












































c 

c 


ID NO: 


28073] 




28407| 












21 633 1 




22028 




202701 




1 




23523 1 


1 








27329 


• 275971 




202561 




23421 1 






Exon 
SEQ ID 
NO: 


78331 


7844 






§ 


g 






11758 


2127 








i 


1 


2922 


13734 


1 








7136 


7385 


18567 









3636 


1 




























































Probe 
SEQ ID 
NO: 


1 




82851 


9178 


921 4 1 


92731 


§ 




1 


22431 


2247 






1484 


1725 


i 


3.J 


1 


4047 


4121 


48461 


s 
s 


7534 


i 






37241 


37241 


5100| 
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Top Hit Descriptor 






IP.vulgaris mRNA for chalcone synthase | 


iqh41d01 .x1 Scares NFL T GBC SI Homo sapiens cDNA clone IUAGE:1847233 3' 1 


Homo sapiens chemokine receptor CXCR4 gene, promoter region and complete cds | 


Homo sapiens EWS, gar22, rrp22 and bam22 genes | 


lMR1-SN0064-010600-006-a12 SND064 Homo sapiens cDNA | 


iMus musculus DIPB gene (Dipb), mRNA | 


Rattus norvegicus cytochrome P4S0 2E1 (CYP2E1) gene, 5' flanking region | 


,501671045F1 NIH_MGC„20 Homo sapiens cDNA clone IMAGE:3954178 5' | 


|Homo sapiens E2F-like protein (LQC5127D), mRNA 1 


|Xenopus laevis alpha(E)-catenin mRNA, complete cds | 


1 Aquifex eeolicus section 96 of 1 09 of the complete genome | 


|zv46h12.s1 Scares ovary tumor NbHOT Homo sapiens cDNA clone 1MAGE:756743 3' similar to gb:M26038 1 
IhLA CLASS II HISTOCOMPATIBILITY ANTIGEN, DR-5 BETA CHAIN (HUMAN); | 


i 

i 
1 

i 
•1 

i" 

i 

£' 

s' 
1 

% 


8 
1 

§ 
8- 

E 
1 
1 


N ectria haematococca kinesin related protein 2 (KRP2) gene, complete cds | 


1 A.carterae precursor of peridinin-chlorophylla-protein (PCP) gene | 


1 Mus musculus histone deacetyiase 5 (Hdac5), mRNA | 


|Mus musculus histone deacetyiase 5 (Hdac5), mRNA | 


|qe07b01 x1 Soares_testis_NHT Homo sapiens cDNA clone IMAGErl 738249 3' similar to contains LTR8.b3 1 
LTRB repetitive element; | 


[Heterodera glycines beta-1 ,4-endoglucanase-1 precursor (HG-eng-1 ) gene, complete cds | 


1 Heterodera glycines beta-1 ,4-endoglucanase-1 precursor (HG-eng-1 ) gene, complete cds ( 


|501680425R2 NiH MGC 83 Homo sapiens cDNA clone 1MAGE:3950503 3' | 


1 Mus musculus chaperonin subunit 6a (zeta) (Cct6a), mRNA 1 


|k1419.seq.F Human fetal heart, Umbda ZAP Express Homo sapiens cDNA 5' | 


1 Homo sapiens mRNA for KIAA0554 protein, partial cds | 


(HLA-H) gene, RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


Top Hit 


ll 








lEST HUMAN | 






[EST HUMAN n 




6 


lEST HUMAN 1 




£ 




EST HUMAN 


lEST HUMAN 1 


z 








z 


EST HUMAN | 






1 




1 






Ton Hit Acession 




,AF204882.1 1 


i 


X06411.1 1 


IAI243326.1 | 


AF052572.1 | 








i 

< 


1 


1 


IU47624.1 1 


|AE000764.1 1 


i 


1 


IM21496.1 1 


lAFI 02993.1 1 


i 




i 


|a|191956.1 1 


IAF052733.1 1 


AF052733.1 


IBE974448.1 




1 


i 


U9132S.1 


IVIost Similar 
(laa) Hit 


BLAST E 
Value 


6.6E-Q2I 


6.6E-02I 


6.6E-02I 


6.6E-O2I 


6.6E-O2I 


6.6E-02I 


3 


6.6E-02I 




6.5E-Q2| 


6.5E-02I 


6.5E^2 


6.5E-02I 


6.5E-02I 


6.5E-02 


I 6.5E-02I 


6.5E-02 


i 


1 6.4E-02I 


1 6.4E-02I 


6.4E-O2I 


1 6.4E-02I 


6.4E-02 




1 


9 


1 6.4E-021 




F 




i 


S 




" 






§ 




















5> 






















ORFSEO 


ID NO: 




24817 


i 


1 


26706 


i 








203131 


20743 


211311 


214741 


25383 1 








20307 






1 


1 


25728 


26210 




i 


1 




Exon 1 


Q 

Si 


15033 




15873 


1 




1 


i 




i 


1 


1 


11275 


i 


1 


17927 




19051 


10501 




i 


1 


15625 


15625 




16732 


i 


17454, 


























































Probe 


NO: 


5167 


1 51901 


i 


1 


i 


1 




1 


i 






i 


1702 


1 54I3I 


1 


1 90341 




i 


1 48021 


5180 


S3 


5718 


5718 


s 


1 




7603 
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1 

1 


(HLA-H) gene, RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


Homo sapiens mucin 5B (MUC5B) gene, partial cds \ 




Mus musculus major histocompatibility locus class III regions HscTOf gene, partial cds; smRNP, G7A, NG23, 
Mue homolog, CLCP, NG24, NG25, and NG26 genes, complete cds; and unlfliown genes 


HEAT SHOCK PROTEIN 70 HOMOLOG | 


Hepatitis G virus RNA for polyprotein (NS5A region), partial cds. strain: CWR-1 52 | 


601873316F1 NIH_MGC_54 Homo sapiens cDNA done IMAGE:4097499 5' | 


TRANSCRIPTIONAL REGULATORY PROTEIN ALGP (ALGINATE REGULATORY PROTEIN ALGR3) | 


Arabidops s thaliana DNA chromosome 4. contig fragment No, 68 | 


complete cds | 


52 KD RO PROTEIN (SJOGREN SYNDROME TYPE A ANTIGEN (SS-A)) (RO(SS-A)) (R052) ( 


AV705701 ADB Homo sapiens cDNA clone ADBBAB03 5' | 


Mus musculus stromal cell derived factor receptor 2 (Sdfr2). mRNA | 




Aquifex aeolicus section 82 of 109 of the complete genome | 


7!37h08ji1 Scares NSF_F8 9W OT_PA_P_S1 Homo sapiens cDNA clone I^MGE:352381 S 3' similar to 1 
TR:Q9Y4S5 Q9Y4S6 HYPOTHETICAL 30.3 KD PROTEIN. [1] ; | 


Human mRNA, Xq terminal portion | 


Arabidopsis thaliana K+ inward rectifying channel protein (AtKCI ) gene, complete cds | 


Homo sapiens mRNA for KIAA1 464 protein, partial ods | 


H.sapiens mRNA for B-HLH DNA binding protein | 


IL3-HT061 8-1 1 0500-1 3S-G06 HTO61 8 Homo sapiens cDNA | 


1 
1 

E 
1 

•1 


Homo sapiens chromosome 21 segment HS21 C007 | 


Thermotoga maritima section 89 of 136 of the complete genome | 


EST380924 MAGE resequences, MAGJ Homo sapiens cDNA | 




zp78c04.r1 Stratagene HeLa cell s3 93721 6 Homo sapiens cDNA done IMAGE:62631 0 5' | 


Top Hit 
Database 
Source 










SWISSPROT 1 




EST HUMAN | 


SWISSPSOT 1 






SWISSPROT 1 


EST HUMAN 1 


2 






EST HUMAN 










EST HUMAN 1 








EST HUMAN | 




EST HUMAN | 


TopHitAcession 
No. 


i 


AF107390.1 1 




AF109905.1 


P37092 1 


AB010162.1 1 


BF21 0736.1 1 


P15276 1 


AL161 572.2 | 


AF271235.1 | 


Q62191 1 


AV705701.1 1 


1 
1 


1 


1 


|BF1 12039.1 1 


D16471.1 1 


U73325.1 J 


j 


i 
1 


m 




AL163207.2 | 


AE001777.1 J 


i 

i 
1 
< 


i 

1 


i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


6.4E-02 


6.4E-02I 


6.4E-02I 


S 


S 


6.3E-02I 




6.3E-02I 


6.2E-02| 


6.2E-O2I 


i 


6.2E-02 


6.2E-02I 


6.2E-02I 


6.2E-02 


1 6.2E-02I 


S 


S 




S 


s 


s 


S 


6.OE-O2I 


6.0E-021 




6.0E-021 


















§ 


s 












? 














i 






g 






10 














ORF SEQ 
ID NO: 






252951 


21490 




278251 






s 










1 






200381 




Si 


268771 








8! 






1 


Exon 
SEQ ID 
NO: 




1 




i 


3467 


1 


i 


1 


1 




43731 


1 




i 




19142 


1 




s 




§ 


g 


1 


1 


2508 


1 


0076 
























































Probe 
SEQID 
NO: 


1 


1 


s 




35521 




1 


93/3 




1 


4479 




7459 






1 


8 


i 




§ 














2906 
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Top Hit Descriptor 


zp78c04.r1 Stratagene HeLa cell s3 937218 Homo sapiens cDNA clone IMAGE:626310 5' ( 


EST84266 Colon adenocarcinoma. IV Homo sapiens cDNA 5' end similar to tissue-specific protein | 




601658150R1 NIH MGC 68 Homo sapiens cDNA clone IN/!AGE:3876060 3' 


RC3-BT0253-01 1 199-013-b04 BT0263 Homo sapiens cDNA | 


Homo sapiens stimulated trans-acting factor (50 l<Da) (STAF50) mRNA | 




601 81 5274F2 NIH MGC 56 Homo sapiens cDNA clone IMAGE:4049226 5' | 


qf58b08j<1 Soares testis_NHT Homo sapiens cDNA clone IMAGE:1754199 3' | 


ts78a06.x1 NCI_ .CGAP_ GC6 Homo sapiens cDNA clone IMAGE:2237362 3' 


ts78a06.x1 NCI CGAP_GC6 Homo sapiens cDNA clone IMAGE:2237362 3' | 






Homo sapiens DNA-dependent protein kinase catalytic subunit-interacting protein 2 (KIP2), mRNA | 


■jvf69h03.x1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:2360885 3' similar to TR:O60298 | 
060298 KIAA0551 PROTEIN ; | 


RC1 -DT0001 -2901 00-01 2-e1 0 DT0001 Homo sapiens cDNA | 






Mus musculus Iroquois related homeobox 5 (Drosophila) (Irx5), mRNA | 


Vlus musculus foIIistatin-like (Fstl), mRNA . 1 


Homo sapiens ninein (LOC51199), mRNA | 


Gallus gallus HKC9 telomere junction I 




Populus trlchocarpa CCoAOMTI gene, exon 1 to exon 5 


Thermotoga maritima section 87 of 1 36 of the complete genome 1 


wx24c02.x1 NCl_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2544578 3' 


wx24c02.x1 NCI CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2544578 3' | 


qh56f01 xl Soares Jetal_liver_spleen_1 NFLS_S1 Homo sapiens cDNA clone IMAGE:1 848697 3' similar to 1 
gb:M13142 COAGULATION FACTOR XI PRECURSOR (HUMAN); | 


qh56f01 xl Soares fetal liver spleen INFLS 81 Homo sapiens cDNA clone IMAGE'.1 848697 3' similar to ] 
gb:M13142 COAGULATION FACTOR XI PRECURSOR (HUMAN); 


Gallus gallus tyrosine kinase JAK1 (JAK1) mRNA, complete cds | 


)pHit 
abase 


z 

i 


1 


1 


1 


UMAN 






UMAN 1 


z 

i 


UMAN 1 


UMAN 1 








z 


UMAN 1 




















3 


i 

5 


UMAN 


UMAN 






EST H 






EST_H 


estIh 


NT 




EST H 


EST- 


ESTh 


EST h 


NT 


NT 


NT 


I 




NT 


NT 


NT 


NT 


NT 


NT 


NT 


NT 


NT 


ESTh 


EST 1- 


EST h 


EST h 


N]_ 


.i 












1 


1 
















































TopHltAces! 
No. 


AA188730.1 


i 


AA372375.1 


BE964443.2 


AW 37021 1.1 






BF382349.1 


1 

< 


AIB23167.1 




AJ245365.1 


AJ245365.1 


11431 


i 
< 


i 
< 


i 

< 


AF006304.1 


§ 


S 


i 


AJ240733.1 . 


D90110.1 


AJ223621.1 


AE001 775.1 


AW051 927.1 


AW051 927.1 


AI247505.1 


|aI247505.1 


< 


JgUJ 


E-02 


s 


E-02 




E^ 


s 


3 


E-02 


E-02 


s 


E-02 


E-02 


E-02 


E-02 


3 


E-02 


E-02 


E-02 


E-02 




S 


E-02 


E-02 


E-02 


3 




E-021 


E-02i 


3 


3 








































o> 


o> 




s 


















Expression 
Signal 


1,221 








3,46 


s 

oi 


cs" 




205 












» 


3.42 


g 


» 




2.72 




§, 


4.35 i 






4.36 






4.21 


2.04 


ORF SEQ 
ID NO: 


1 




229221 




8 


248481 


i 


252841 




274061 


27407 


27486 


2748/1 


25294 




20011 


22681 


1 


1 














23931 


239321 


1 24132 


S 






0076 




3116 






s 






6383 




1 






i 




i 




18 




s 


1 


i 


0841 




a 


i 


4156 


14341 


1 


































































Probe 
SEQ ID 
NO: 




3191 






ii 


6172 


6172 


1 




7340 


7340 


7411 


7411 


s 


9715 




P955 


4770 


7008 


8165 


£S 
S8 


1 




2830 


8 


4257 


4257 


J 


4447 


4471 
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Top Hit Descriptor 


Mus musculus tuftelin 1 (Tufti), mRNA | 




Homo sapiens elF4E-transporter (4E-T), mRNA | 






Oryaa sativa rbbi3-1 gene for putative Bowman Birk trypsin inhibitor | 


RC5-BT0559-1 40200-01 2-CD3 BT0559 Homo sapiens cDNA | 


Xenopus laevis homeobox protein (Vox-1 ) mRNA, complete cds | 


Mus musculus p-glycoprotein (mdrIa) gene, exons 1 and 2 


1 

o 
o 

1 

1 

1 
1 


Rana catesbiana heat shock protein 30 (HSP30) mRNA. complete cds | 


QV0-ST021 3-021 299-062-a09 STC213 Homo sapiens cDNA | 


QVO-ST0213-021299-062-a09 ST0213 Homo sapiens cDNA | 


ye37f12r1 Stratagene lung (#937210) Homo sapiens cDN A clone 1MAGE:1 19951 5' similar to gb:K01 506 | 
HU\ CI^SS il HISTOCOMPATIBILITY ANTIGEN, DP(1) ALPHA CHAIN (HUMAN); | 


Pseudomonas putida ttgS gene | 


1 

1 
E 

D 


Drosophila melanogaster laminin B2 gene, complete cds | 


Pseudomonas putida ttgS gene | 


i 

i 

i 

1 
1 


Mus musculus caudal type homeobox-1 (Cdx-1) gene, complete cds 1 


Helicobacter pylori 26595 section 5 of 1 34 of the complete genome | 


Helicobacter pylori 26595 section 5 of 1 34 of the complete genome | 


Lymphocystis disease virus 1 , complete genome I 


nuclear protein TIF1 isoform [mice. mRNA. 4053 nt] | 


Podospora anserina mitochondrial epsilon-sen DNA | 


Homo sapiens meprin A, alpha (PABA peptide hydrolase) (MEP1 A) mRNA | 


Homo sapiens partial LM01 gene for LIM domain only 1 protein, exon 1 | 


Homo sapiens partial LM01 gene for LIM domain only 1 protein, exon 1 | 


Human steroid hormone receptor Ner-I mRNA, complete cds ( 




Top Hit 1 


Source 


NT 1 




NT 


NT 


iNT 




EST HUMAN | 


NT 1 


NT 1 




NT 1 


i 


EST HUMAN | 


EST human! 


NT 


NT 1 


NT 


NT_ 1 


NT 


NT 1 










NT 


NT _ 




NT 




NT 


Too Hit Acession 




i 


109470341 


109470341 




U09771.1 


AJ277468.1 1 


BE073468.1 | 


U53528.1 1 


i 

i 


U20790.1 1 


U44894.1 1 


AW391 248.1 1 


AW391 248.1 1 


T94759.1 1 


AJ276408.1 


M58417.1 1 


M58417.1 


AJ276408.1 


AJ01 1048.1 


i 
i 


AE000527,1 1 


AE000527.1 1 


i 


S78221.1 J 


X03127.1 j 


50319081 


AJ277551.1 


AJ277661.1 1 


U07132.1 j 


L33246,1 1 




BUVSTE 
Value 


5.5E-02] 


5.5E-02I 


5.5E-02| 


5.5E-02I 


S 


S 


S 


S 


5.4E-02I 


5.4E-J 


5.4E-02I 


5.3E-02 


5.3E-02 


5.3E-02 


5.3E-02 


5.3E-02j 


5.3E-02 


5.3E-02 


5.3E-02 


5.3E-02I 


5.3E-02I 


i 


5.3E-02I 


1 


1 


5.2E-02 


5.2E-02 


5.2E-02 


5.2E-02 


5.2E-02 


i 


Signal 






















1.55| 


_ 1.62 


































3.351 


1.041 


c 
o 


d 


1 








§ 






24563 


247511 






207971 






^ 




1 






a 


24955 


i 


26145 








g 


1 


23854 


24311] 


Exon 
SEQID 
NO: 


6228 






7470 


s 


2914 


5078 


4787 




17974 




99601 


0955 


1 


2324 




2839 


3038 


i 


1 


5180 


i 






s 


2123 


3003 


3003 


4061 


4521 






























































Probe 
SEQ ID 
NO: 


i 


1 


7566 


e 


8382 


i 


3375 


s 


51081 


1 


8 


1037 


1037 


.J 


2447 


2912 


?912 


3113 


1 


5021 


5258 


5258 


9U9 


6353 


7276 


2239 


1 


3076 


4181 


46331 
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I! 
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Top Hit Descriptor 


|601177907F1 NIH MGC 17 Homo sapiens cDNA clone IMAGE:3533353 5' | 


lQV1-NN0012-180400-164-f06 NN0012 Homo sapiens cDNA 1 


[Homo sapiens KiAA0867 protein (KIAA0867), mRNA 1 


Fugu rubripes neural cell adhesion molecule Ll homolog (LI-CAM) gene, complete cds; putative protein 1 
complete cds; and calcium channel alpha-1 subunit> 


IBrassica napus gin gene for plastid glutamine synthetase, axons 1-12 1 


1 Homo sapiens mRNA for K1AA1 471 protein, partial cds | 


Homo sapiens cytochrome P450 polypeptide 43 (CYP3A43) gene, partial cds; cytochrome P450 polypeptide 
polypeptide 5 (CYP3A5) gene, partial cds 


IStrongylocentrotus purpuratus homolog of human bone morphogenettc protein 1 (submp) mRNA, complete 1 


GLUCOAMYLASE S1/S2 PRECURSOR (GLUCAN1,4-ALPHA-GLUCOSIDASE)(1,4-ALPHA-D-GLUCAN 
GLUCOHYDROLASE) 


c 

i 

i 

■ i 
1 
1 
1 

1 


1 Ovis aries mRNA for acetyl-coA carboxylase | 


tUI-H-BW1-anx-h-08-0-Ul.s1 NCI_CGAP_Sub7 Homo sapiens cDNA clone )MAGE:3084134 3' 1 


|FAS ANTIGEN LIGAND | 


IM.musoulus DNA for desmin-binding fragment DesD7 1 


Homo sapiens succinate dehydrogenase complex, subunit C, integral membrane protein, 15kD (SDHC) | 


1 Homo sapens hypothetical protein PR01 1 63 (PR01 1 63), mRNA | 


Homo sapiens hypothetical protein PR01 1 63 (PR01 1 63), mRNA 1 




jHuman germline T-cell receptor beta chain TCRBV17S1A1T, TCRBV2S1, TCRBV10S1P, TCRBV29S1P, 
TCRBV19S1P, TCRBV15S1, TCRBV11S1A1T, HVB relic, TCRBV28S1P, TCRBV34S1, TCRBV14S1, 
|TCRBV3S1, TCRBV4S1A1T, TRY4, TRY5, TRY6, TRY7, TRY8, TCRBD1, TCRBJ1S1, TCRBJ1S2,> 


Mus musculus chromosome X contigB; X-linked lymphocyte regulated 5 gene, Zinc finger protein 275, Zinc 1 
'finger protein 92, mmxq28orf 1 


501510891F1 NIH MGC 71 Homo sapiens cDNA clone 1MAGE:3912215 5' | 


Top Hit 
Database 
Source 




Iest human 1 


|NT 1 


NT 


iNT 1 


iNT 1 


NT 


NT 


SWISSPROT 


E z 


iNT 1 


[EST HUMAN 1 


1 SWISSPROT 1 


NT _ 


u 


H 


H 1 





z 




|EST HUMAN | 


Top Hit Acession 
No. 


IBE297236.1 | 


S 
1 




AF0261981 


|AJ271 909.1 1 


IAB040904.1 1 


AF280107.1 


i 


P08640 


AJ001 018.1 


IaJOOI 056.1 1 


IBF516149.1 1 




|AJ403386.1 1 


i 


i 
§ 


1 

CO ; 


i i 

ll 

a < 




1 
< 


|BE885137.1 | 


Most Similar 
(Top) Hit 
BIj^STE 
Value 


1 4.1E-02 


1 4.1E-02 


1 4.1E-02I 


4.1E-02 


1 4.1E-02 


1 4.0E-02i 


4.0E-02 


4.0E^)2 


4.0E-02 


4.0E-02 




1 3.9E-02I 


1 3.9E-02I 


1 3.9E-02| 




! 3.9E-02I 


o c 


5 9 

! ^ 


3.9E-02 


s 


3.8E-02] 


h 

S' 


1 0.861 


1 7.37| 






1 12.83| 


% 






2.8 










1 3.04| 






i ^ 


• 1 








ORFSEQ 
ID NO: 


1 236271 






i 


i 


22932 




26567 


271261 


? 


1 251351 






i 














21 683 1 


Exon 
SEQID 
NO: 


137361 




i 


16423 


i 


131301 


15225 




M 


?| 


i 


110201 


i 


11813| 


12536| 


14973| 


ii 


si 


19248 


i 


11804| 


Probe 
SEQ ID 
NO: 




1 43721 


i 


1 






5304 




S 1 


i S 
i s 




1 11041 


1322 


1 1918| 




5105 


1 51051 








i 



207 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



^1 
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Top Hit Descripbr 


CYSTATHIONINE BETA-LYASE PRECURSOR (CBL) (BETA-CYSTATHIONASE) (CYSTEINE LYASE) 


EST370539 MAGE resequences. MAGE Homo sapiens cDNA 


IHYPOTHETICAL 80.7 KD PROTEIN IN S0D1-CPA2 INTERGENIC REGION | 


[Maizeaclin 1 gene(MAGl), oompletecds | 


|601644701R2 NIH UGC 56 Homo sapiens oDNA clone IMAGE;3929737 3' | 


iL.lactis MG1363 grpE and dnaK genes | 


[PM1-C 0326-291 299-002-h03 CT0326 Homo sapiens cDNA | 


PM1-CT0326-291299-002-h03 CT0326 Homo sapiens cDNA | 


1 60 1 1 78765F1 NIH MGC 20 Homo sapiens cDNA clone IMAGE;3543833 5' | 


[Homo sapiens mRNA for FLJ00013 protein, partial cds | 


f 
1 

3 

2 

E 

5 

! 

E 


Homo sapiens mRNA for FLJ00013 protein, partial cds | 


[Homo sapiens mRNA for FLJ00013 protein, partial cds | 


»/26d07.x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:2814253 3' similar to 1 
SW:C21 1_HUMAN P53801 PUTATIVE SURFACE GLYCOPROTEIN C210RF1 PRECURSOR ; 




|yc20e06.r1 Stratagene lung («37210) Homo sapiens cDNA clone IMAGE:81250 5' similar to contains 1 
ImEP^9 repetitive element 


iHomo sapiens chromosome 21 segment HS21C008 1 


RC3-FN0155-0607aO-011-d10 FN0155 Homo sapiens cDNA 1 


RC6-UM0015-210200-021-A10 UM0015 Homo sapiens cDNA | 




\IA PROTEIN HOMOLOG (LA RIBONUCLEOPROTEIN) (LA AUTOANTIGEN HOMOLOG) \ 






iwl99d04.x1 NCLCGAP_Bm25 Homo sapiens cDNA clone IMAGE:2433031 3' | 


zq04f1 1 .si Stratagene muscle 937209 Homo sapiens cDNA clone 1MAGE:628749 3' similar to 
TR;G1017425G1017425 

ilPISGKPLPKVTLSRDGVPLKATMRFNTEITAENLTINLKESVTADAGRYElTAANSSGTTKAFINIWLDRPG 
iPPTGPWISDITEESVTLKWEPPKYDGGSQVTNYlLLKRETSTAVWTEVSATVARTMMKVMKL ... ; 


i 

i 

1 
< 

z 

.1 

1 

X 
X 

i 
i 


Cricetulus griseus CYP2A17 mRNA for cytochrome P450 2A17, complete cds 1 


jHomo sapiens skeletal muscle LIM-protein 1 (FHL1) gene, complete cds | 


Top Hit 
Database 
Source 


SWISSPROT 


1 
1 


ISWISSPROT 1 


z 


|EST HUMAN | 


z 


|EST HUMAN | 


lEST_HUMAN | 


|est_human 1 


z 


z 


z 


z 


EST HUMAN 


z 


EST HUMAN | 




lEST HUMAN 1 


|EST HUMAN | 




ISWISSPROT 1 


z 






EST HUMAN 


|EST HUMAN 1 






TopHitAcession 
No. 


P53780 


1 
< 


i 


IJ01233.1 1 


IBE958970.1 | 


1X76642.1 1 


|AW861 641.1 1 


|AW861641.1 1 


i 

i 


|AK024424.1 | 


IAK024424.1 | 


IAK024424.1 | 


|AKD24424.1 | 


AW274020.1 




i 




1 


IAW794952.1 | 


i 


IQ26457 J 


lAJOl 2469.1 1 


1 


S 


AA1 94306.1 


1 


i 
1 


|AF1 10763.1 1 


Most Similar 
(Top) Hit 
BU\STE 
Value 


3.5E-02 


i 


1 3.5E.02I 


1 3.5E.02I 


1 3.5E.02I 






1 3.5E-021 


1 3.5E.02| 


I 3.4E-02I 


1 3.4E-02I 


3.4E-021 


t 3.4E-02I 


3.4E-02 


3.4E-02I 


3.4E-02 


3.4E-02I 


3.4E-02| 


3.4E-02I 


1 3.4E-02I 


3.4E-02I 


3.4E-02| 


3.4E-02| 


3.4E-02I 


3.4E-02 


3.3E-02I 


3.3E-02I 


3.3E-02I 


§ 

II 

r 
































E. 


























ORF SEQ 
ID NO: 








1 257871 


1 27082| 




s 


1 




20310 




1 203101 


1 


20795I 




22120 1 






1 


241741 




1 










209071 


21 383 1 


Exon 
SEQID 
NO: 




14429 


i 






i 


i 


§ 


i 




i 


10504 


10504 


1 


1 


1 


1 








1 


i 


15105 


1 


1 


i 


11064] 




















































Probe 
SEQ ID 
NO: 


1 


1 45361 


1 5154| 


f2 
!» 


1 7012| 


7755 


i 




1 9749 1 


i 


1 






1035 


1188 




1 33841 


i 


1 


4494 


1 




1 60951 


i 


i 


§ 
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Top Hit Descriptor 


1 
1 
1 
t 
1 
g 

1 

i 
1 

! 


|yf25c09.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:127888-5' | 


Homo sapiens skeletal muscle LIM-protein 1 (FHL1 ) gene, complete cds | 


|Mus musculus tumor rejection antigen gp96 (Tral), mRNA | 


|501853910F1 NIH MGC 57 Homo sapiens cDNA clone IMAGE:4073787 5' | 


i 

1 

i, 
s 

i 
1 


|S022471 71 F1 NIH_MGC_62 Homo sapiens oDNA clone IMAGE:4332497 5' | 


ye4Sf11.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:121101 5' 


1 Human interleukin 11(111) gene, complete mRNA 1 


Orycfolagus cunlculus gene encoding ileai sodium-dependent bile acid transporter 1 


j Drasophila melanogaster heat shock protein 68 (hsp68) gene, hsp68d allele, complete cds | 




[Homo sapiens telomerase reverse transcriptase (TERT) gene, exons 7-1 6 and complete cds | 


tU\RGE TEGUMENT PROTEIN | 


[Oryctolagus cuniculus gene encoding ileal sodium-dependent bile acid transporter | 


1 

1 
< 
g 

i 

5. 

i 
I 

■J 

i 


Homo sapiens chromosome 21 segment HS21 C003 1 


1 H.sapiens RP3 gene (XLRP gene 3) 1 


Saxifraga nidifica maturase (matK) gene, chloroplast gene encoding chloroplast protein, partial cds 1 


S.griseocarneumwhiG-Stvgene | 


£: 

? 
E 


1 Rat/poIyomavirus left junction in cell line W98.14 | 


1 yd33h1 2.S1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:1 1 0087 3' similar to contains 1 
AIu repetitive element;contains LTR1 repetitive element ; ( 


Saguinus oedipus tissue kallikrein gene, complete cds | 


Mus musculus kinesin family member 3c (Kif3c), mRNA | 


|zg54b12.s1 Soaresjinealjland N3HPG Homo sapiens cDNA clone IMAGE:3971 51 3' similar to 1 
gb:L08441 CYTOCHROME C OXIDASE POLYPEPTIDE III (HUMAN); | 




Homo sapiens dual specificity phosphatase 4 (DUSP4).mRNA | 


1 NEURONAL ACETYLCHOLINE RECEPTOR PROTEIN, ALPHA-3 CHAIN PRECURSOR (GF-ALPHA-3) | 


1 Mus musculus adaptor-related protein complex AP-3, delta subunIt (Ap3d), mRNA | 


1 Drosophila melanogaster mRNA for headcase protein | 


Top Hit 
Database 
Source 


z 


IEST HUMAN 1 


z 


z 


|EST HUMAN 1 


z 

i 


[EST HUMAN | 


lEST HUMAN 1 




z 


Z 






ISWISSPROT 1 




|EST HUMAN | 


z 


z 


2 








z 

I 






EST HUMAN | 




z 


0 

1 
% 






Top HitAcession 




|AE000700.1 1 


IR09112.1 1 


< 




j 


IBF245995.1 | 


1BF691 107.1 1 


T96545.1 


|M81890.1 1 


IAJ002005.1 1 


S 
£ 


i 




IP28955 1 


|AJ002005.1 1 


1BE857353.1 | 


IAL163203.2 | 


1 


AF1 14182.1 




1 


IM32437.1 1 


i 


< 




1 


IAB021 684.1 1 


s 


1 


1 




Most Similar 
(Top) Hit 


BLAST E 
Value 


1 3.3E-02I 


1 3.3E-02I 


1 


^3.3E-02| 


9 


1 3.3E-02| 


1 3.3E-02I 


3.3E^2 


3.3E^2 


3.2E-02| 








1 3.2E-02I 


1 3.2E-02I 


3.2E-02I 


1 3.2E-02I 


1 3.2E-02I 


3.2E-02 


1 3.2E-02| 


3.2E-02I 


3.2E-02I 


3.2E-02 


3 

m 


3.2E-02I 


3.2E-02I 


t 3.2E-02I 


3.1E-02I 


3 


3 


3 


i 
1 


Signal 












































^ 


















§. 










































ORF SEQ 
ID NO: 








1 


1 259011 


1 25902] 


i 






1 


i 


1 


1 21 503 1 




1 


22817 


233541 




. 24333 


1 25365] 




259591 




L 26013J 










§ 


1 216211 




SEQ ID 
NO: 


1 


11929 


i 


14262 


i 


i 




i 






11025 






1 










s 




15318 


15836 




1 


i 


1 


1 


1 






































































Probe 
SEQID 
NO: 


1705 


1 20381 


9 


1 


5875 


5875 


i 


i 


1 94411 




1 11101 


1 IIIOl 


1 17341 


8 


1 


1 


i 




i 


1 


8 


5931 


i 


i 


1 


7786 


s 


i 


1283 


1850 


1 
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Top Hit Descriptor 


[Chicken dorsalin-1 mRNA, complete cds ( 


1 

•s 

1 
t 
8 

£ 

! 

i 

! 

i 
§ 

1 


Ixa52b04.x1 NCI CGAP Sar4 Homo sapiens cDNA clone IMAGE:2570383 3' similar to SW :Y059 HUMAN 1 
Q15041 HYPOTHETICAL PROTEIN KIAA0D69 ; 


|qg27f11.x1 NCI CGAP Kid3 Homo sapiens cDNA clone IMAGE:1762317 3' | 


[601 493473T1 NIH_MGC_70 Homo sapiens cDNA clone IMAGE:3895578 3' 1 


1 Rattus norvegicus Nerve growth factor receptor, fast (Ngfr), mRNA | 


|Homo sapiens KIAA1070 protein (KIAA1070), mRNA | 




zs84c0Zr1 NCI CGAP GCB1 Homo sapiens cDNA clone IMAGE:704152 5' | 


lb 
1 

i 
1 

Q 

i 
1 
1 

i 
■f 
1 


1602015501 F1 NCI_CGAP_Brn64 Homo sapiens cDNA clone IMAGE:4150944 5' 1 


on26f06.y5 NGLCGAP_Lu5 Homo sapiens cDNA clone IMAGE:1557827 5' | 


|on26f06,y5 NCLCGAP„Lu5 Homo sapiens cDNA clone IMAGE:1557827 5' | 


1 

1 
1 

i 

1 

i 


|601680305R2 NIH_MGC_B3 Homo sapiens cDNA clone IMAGE:3950666 3' | 






IH.carterae mRNA for fucoxanthin chlorophyll a/c binding protein, Fcpl | 


!PM2-NN0128-080700-001-a12 NN0128 Homosapiens cDNA 


|PM2-NN0128-080700-001-a12NN0128 Homo sapiens cDNA | 


i 

.i 
§ 

ir 

i 

1 


1 7e30e09.x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone IMAGE:3284008 3' similar to contains LI .t1 LI 1 
repetitive element ; | 


I601579393F1 NIH_MGC_9 Homo sapiens cDNA clone IMAGE:3928054 5' 1 


i 

s 

z 

1 

1 

i 

i 


i 

i 

1 

1 
1 

i 

i 
1 


IHYPOTHETICAL 45.7 KD PROTEIN C19G10.05 IN CHROMOSOME 1 | 


IHYPOTHETICAL 45.7 KD PROTEIN C19G10.05 IN CHROMOSOME 1 | 


i 

of 

1 

1 

1 
J 


1 butyrophilin-like (NG9), butyrophllin-li> 


Homo sapiens gene for LECT2, complete cds , ( 


III 








1 












S 


1 


i 


1 


i 


1 


















i 


1 


0 




















Z3 




























I 


X 












1 
























z 










h' 


























h 




i 








Top HitAcession 
No, 


|L1 2032.1 1 


AE002014.1 1 


AW241 154.1 1 


AI206030.1 1 


BE621 748.1 | 


i 


114320201 


AL163303.2 J 


AA279351.1 | 


AW500547.1 1 


BF343827.1 1 


AI793130.1 1 


AI793130.1 1 


BE974314.1 | 


BE974314.1 | 


5 


1 


1 


BE701165,1 1 


BE701 165.1 1 


AW592114.1 1 


BE670128.1 1 


i 


BF526722.1 1 


1 


Q10335 1 


Q10335 ] 


AJ237936.1 ] 


AF050157,1 


AB007545.1 ] 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 2.6E-02I 


1 2.6E-021 


1 2.6E-O2I 


S 


1 2.6E-O2I 


1 . 2,6E-02| 


S 


|. 2,6E-02| 


I 2,6E-02| 




1 2.6E-02I 


1 2.5E.02I 


1 2,5E-02| 


1 2,5E-02| 


S 


3 


^ 


1 25E-02I 


I 2.5E-02I 


2,5E-02| 


9 


2,5E-02| 


i 


1 2.5E-02I 


S 


2.5E-02I 


9 


2.5E-021 


2.5E-02 


9 
1^ 

oi 


Expression 
Signal 








































0 






















ORF SEQ 
ID NO: 


1 24481 1 


s 


24667 




25895 1 










1 


1 




202751 


20562] 


1 










236441 


237881 






1 


i 


28320] 


28321 ] 


28370] 






cS .. 


1 




1 




15776 


15977 


17306 


17798 


18579 


1 






1 




10778 


1 


s 


g 


S 






1 




16378 


e 




s 






1 
































































Proba 
SEQID 
NO: 


9 


i 


1 5025 




1 


1 61301 


1 7388| 


1 


1 


1 












1 2735| 


i 


1 2926] 


S 






1 5759I 




1 




1 8185| 


1 8185| 


1 


8255 


1 
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Top Hit Descriptor 


1 MR4-TN01 1 5-0809D0-201 -b1 2 TN01 1 5 Homo sapiens cDNA | 


Izq40g10.r1 Stratagene hNT neuron (S937233) Homo sapiens cDNA clone 1MAGE:632226 5' ( 




IHYPOTHETICAL CALCIUM-BINDING PROTEIN C18B11.04 IN CHROMOSOME 1 | 


1 Cyanophora paradoxa cyandle, complete genome | 


1 Homo sapiens KIAA1 009 protein (KIAA1 009), mRNA 1 


iHomo sapiens chromosome 21 segment HS21C103 ) 


1 Homo sapiens valyl^tRNA synthetase 2 (VARS2), mRNA | 


|602019135F1 NCLCGAP^Bm67 Homo sapiens cDNA clone IMAGE:41 54504 5' 1 


yh54b10.r1 Soares placenta Nb2HP Homo sapiens cDNA clone IMAGE:133531 5' 


|yh54b10.r1 Soares placenta Nb2HP Homo sapiens cDNA clone IMAGE:133531 5' | 


1 Plasmcdium falciparum (strain FCR3) variant-specific surface protein (var-2, var-3) genes, complete cds's | 


1 RC4-CN0049-1 401 00-01 1 -Cl 1 CN0049 Homo sapiens cDNA | 


O 

3 
1 

i 

1 

< 




[Homo sapiens NESH protein (L0C51 225), mRNA 


|HaemophiIus influenzae Rd section 115 of 163 of the complete genome | 






1 AV723785 HTB Homo sapiens cDNA clone HTBAHH1 1 5' | 




Ixb09d09.x1 NCLCGAP_GU1 Homo sapiens cDNA clone IMAGE:2575793 3' 1 


lArabidopsis thaliana DNA chromosome 4, contig fragment No. 82 . 1 


lArabidopsis thaliana DNA chromosome 4, contig fragment No. 82 | 


] Homo sapiens coagulation factor XII (Hageman factor) (F1 2), mRNA | 


|Mus musculus histccompatibility 2, complement component factor B (H2-Bf), mRNA | 


IEST374761 MAGE resequences, MAGG Homo sapiens cDNA 1 


1 EST374TS1 MAGE resequences. MAGG Homo sapiens cDN A | 


|601567403F1 NIH_MGC_21 Homo sapiens cDNA clone IMAGE:3842280 5' 1 


I601567403F1 NIH_MGC_21 Homosapiens cDNA clone IMAGE:3842280 5' | 


Top Hit 

Database 
Source 


|EST HUMAN 






ISWISSPROT 










EST HUMAN 


EST HUMAN 


1 




|EST HUMAN 


|EST HUMAN j 












|EST HUMAN | 




EST HUMAN 1 






z 


z 






EST HUMAN | 


,EST HUMAN j 


TopHitAcession 
No. 


|BF0929421 | 


1 

i 


|AF260225.1 | 


|Q09711 1 


1 114672821 




|AL163303.2 1 


1 11417739] 


BF345554,1 1 


R32667.1 


|R32667.1 1 


i 
S 


|AW750834.1 | 


i 




1 77059801 


1 


IU67779.1 J 


|AF21 6854.1 1 


IAV723785.1 | 


§ 
< 


|AW074212.1 J 


1 
< 


|AL1 31 586.2 | 






1 


i 
s 


|BE733142.1 1 


|BE7331421 | 


Most Similar 
(Top) Hit 
BLAST E 
Value 




.5E-02 


.5E-02! 


S 


.5E-02, 


.5E-02 


.5E-02 


.5E-02: 


S i 




.5E-02 


S 


.5E-02 


.5E-02 


s 


.4E-02 


S 


.4E-02I 


.4E-02, 


S 




.4E-02 


.4E-02 


.4E-02I 


.4E-02 


.4E-02 


.4E-02 


.4E-02 


s 


.4E-02 






























































Signal 


















^ E 


























1 


















ORF SEQ 
ID NO: 


1 23365 


1 23727 


1 24048 






26399 


i 






5 






















1 


230741 




23162] 


23197] 


23308) 


_ 2406^ 


s 




24446] 


Isi 


13577 


13951 


1 






16239 








I 1 






g 




8 


















13356 


13392 


13620 


14283 


14283 


i 


1 
































































<» g 


1 


1 


1 


1 


i 


s 


i 


1 


Si 




1 


1 


1 


§ 


















1 


1 


s 


i 


1 


I 






|Si 
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Top Hit Descriptor 


602018037F1 NCLCGAP_Bm67 Homo sapiens cDNA clone IMAGE:4153808 5' 


za40e05.r1 Scares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 1MAGE:295040 5' | 


Homo sapiens MHC class 1 region 1 


fq95b10.x1 NCI CGAP_Ov23 Homo sapiens cDNA clone IMAGE:2216539 3' similarto SW:XPF_HUMAN 
Q92889 DNA-REPAIR PROTEIN COMPLEMENTING XP-F CELL ; 


=M3-HT0175-300999-001-h0S HT0175 Homo sapiens cDNA 1 


1 
C 

< ^ 

u 

e i 
<f> I 

^ \ 
9 ' 

Si 


QV2-STD2g6-1 50200-O28-c1 1 ST0295 Homo sapiens cDNA 


1 

5 


0 
0 

3 

g 
Q 

8 

^ 




EST1 85494 Colon carcinoma (HOC) cell line II Homo sapiens cDNA 5' end 1 


Homo sapiens T-box 5 (TBX5), mRNA 


ab77f1 1 s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IMAGE:853005 3' similar to contains 1 
Alu repetitive element; 


MR3-CT0176-111099-003-e10 CT01 7B Homo sapiens cUNA 1 


CM2-HT0177-041099-017-ti12HT0177HomosapienscDNA 1 


oo22h08.s1 NCI CGAP GOBI Homo sapiens cDNA clone IMAGE:1350495 3' | 


301 649957R1 NIH MGC 74 Homo sapiens cDNA clone IMAGE:3933689 3' | 


MRO-CT0050-081099-003-h10 CT0060 Homo sapiens cDNA 1 


i! 

II 
il 

1 ] 

s i 
s % 
£ 

II 
§ I 

11 


yq54h01 .r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:199633 5' 


Homo sapiens attractln precursor (ATRN) gene, exon 25 and complete cds. alternatively spliced 


D 

u 

II 

i! 
il 
i| 

U 

5 ! 


MR4-BT035S-070100-201-h01 BT0355 Homo sapiens cDNA I 


1 

I 

3 

& 
i 


501459570F1 NIH_MGC_66 Homo sapiens cDNA clone IMAGE:38631 77 5' 


501459570F1 NIH MGC 66 Homo sapiens cDNA clone IMAGE:3863177 5' | 


Top Hit 
Database 
Source 


EST HUMAN | 


EST HUMAN 1 




EST HUMAN \ 


EST HUMAN | 


EST HUMAN j 


EST HUMAN 


EST HUMAN | 


SWISSPROT 1 


EST HUMAN | 


EST HUMAN 1 


z 


EST HUMAN 


EST HUMAN | 


EST HUMAN J 


il 

z> z 

I 3 


EST HUMAN 


EST HUMAN 1 


EST_HUMAN j 


EST HUMAN | 


z 


EST_HUMAN 


Si 


z 


S 


EST HUMAN , 


TopHitAcession 
No. 


BF345263.1 1 


1 


AF055066.1 | 


AI653508.1 1 


BE1 44537.1 1 


AW81 3796.1 j 


AW820281.1 


BE149611.1 1 


Q61982 1 


AA082578.1 | 




114355051 


1 


AW846120.1 1 


AW368128.1 1 


8 


I i 

\ S 


S 
1 


s 

1 


1 


i 


BE876539.1 


=1 


Z29642.1 


BF035331.1 1 


BF035331.1 


Most Similar 
(Top) Hit 
BLASTS 
Value 


1.1E-02I 


1.1E-02I 




I.IE-02I 


1.1E-02 


I.IE-02I 


1.1E-02 


I.IE-02I 


1.1E-02 


I.IE-02I 


1.1E-02 


1.1E-02 


1.1E-02 


1.0E-02 


1.0E-02 


1.0E-02| 




1.0E-02 


S 


1.0E-O2 




1.0E-02 


1.0E-02 


S 




1.0E-02 


Expression 
Signal 




s 














E5 


^ 


















i 
















ORF SEQ 
ID NO: 






22660 


23200 












27785 


27900 


2847S 




1 


21270 




22950 




23505 


24407 


1 










27476 


Exon 
SEQ ID 
NO: 


1 






3394 


? 




4704 




7037 


7560 


1 


8224 


1 








11 


3380 




ii 


4870l 


5OI61 






























































Probe 
SEQID 
NO: 


i 


2848 




3478 


- 3P37 








1 


7710 


7810 


8347 


i 




1506 


2525 






8 


ii 


i 




5751 


i 


7403 


1 7403 
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Top Hit Descriptor 


Rattus norvegicus neuronal nicotinic acetylcholine receptor subunit (AiphalO) mRNA, complete cds | 


|xe34f09.Xl NCi_CGAP_Ut1 Homo sapiens cDNA clone IMAGE:2509033 3' similar to TR:Q1 2987 Q1 2987 1 
ACIDIC 82 KDA PROTEIN.; | 


O 
1 

1 

i 

a 

1 
i 
I 

o 

H 




Iho39h08.x1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:3039807 3' similar to TR:093434 ( 
093434 RETICULOCALBIN. ; • | 


hD39h08.x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:3039807 3' similar to TR:093434 1 
093434 RETICULOCALBIN. ; | 


[RC1-CT0286-050400-O18-C08 CT0286 Homo sapiens cDNA | 


jzd33f1 0.r1 SoaresJetaLheart_NbHH19W Homo sapiens cDNA clone 1MAGE:342475 5' | 


IEST30674 Colon 1 Homo sapiens cDNA 5' end | 


7g34b10j<1 NCI CGAP Bm23 Homo sapiens cDInJA clone IMAGE:3308347 3' similar to TR:Q13387 1 
Q13387 HYPOTHETICAL PROTEIN 384D8„2. ;contains TAR1.t2TAR1 TAR1 repetitive element ; | 


< 

1 

1 

i 

i 

o 


S cerevlsiae chromosome 1 1 reading frame ORF YBL077W 1 


S.cerevisiae chromosome II reading frame ORF YBL077W | 


IRC5-HT0582-160300-011-D02 HT0582 Homo sapiens cDNA | 


|BETA-GALACTOSIDASE PRECURSOR (LACTASE) | 


|BETA-GALACT0S1DASE PRECURSOR (LACTASE) | 


IAV687379 GKC Homo sapiens cDNA clone GKCAFC07 5' | 


|Bos taurus mRNA for NDP52, complete cds | 


yvl 5h01 .s1 Soares fetal Hver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:242833 3' similar to contans 
Alu repetitive element; 


601145154F2 N1H_MGC_19 Homo sapiens cDNA clone IMAGE:3160476 5' 1 


hd22a05.x1 Soares NFL T GBC S1 Homo sapiens cDNA clone IMAGE:2910224 3' similar to 
SW:PXR_HUMAN 075469 ORPHAN NUCLEAR RECEPTOR PXR ; 


|hd22a05.)Cl Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:2910224 3' similar to 1 
|SW:PXR HUMAN 075469 ORPHAN NUCLEAR RECEPTOR PXR ; | 


jDanio rerio odorant receptor gene cluster | 


|ah78e1 1 .si Soares testis NHT Homo sapiens cDNA clone 1321772 3' | 




Top Hit 
Database 
Source 




EST HUMAN | 


|EST HUMAN | 




i 


EST HUMAN | 


I EST HUMAN 1 


jEST HUMAN | 


jEST HUMAN | 


EST HUMAN | 


lEST HUMAN 1 






|EST_HUMAN | 


ISWiSSPROT 1 


ISWISSPROT 1 


lEST HUMAN | 


z 


EST HUMAN 


EST HUMAN 1 


EST HUMAN 


i 




|EST HUMAN | 


[EST HUMAN | 


TopHilAcession 
No. 


< 


|aW117711.1 


§ 


i 

s 


BE044191.1 


BE044191.1 


i 
< 


i 


IAA327129.1 


BE857385.1 


|BE928133.1 






|BE175667.1 






IAV687379.1 | 


IAB008852.1 | 


H94065.1 


BE263253.1 


AW511148.1 


AW51 1148.1 


1 
< 


|AA759135.1 


IAA759135.1 | 


Most Similar 
(Top) Hit 
BU^^STE 
Value 


1 7.0E-03 




1 7.0E-03I 


8 


7,0E-03 


1 7.0E-03 


1 7.0E-03| 


1 7.0E-03I 


1 7.0E-03I 


7.0E-03 


1 7.0E-03I 


S 


s 


1 7.0E-03I 


s 


9 


7.0E-03I 


7.0E-03I 


7.0E-03 


7.0E-03 


u S 


6.0E-03 


s 


1 6.0E-03I 


I 6.0E-03I 


g 




^ 




























S 














p 






ORF SEQ 
ID NO: 


1 














1 258311 


1 












1 27670] 


1 




1 










i 




1 22582] 


Exon 
SEQ ID 
NO: 


1 






14795 






1 








i 




1 






s 


17756 




19750 










12790 


1 


















































Probe 
SEQID 
NO: 


1 3741 


1 4434 


1 


i 


1 50381 


J 


1 57171 


5813 


i 






1 64411 


i 




1 


1 




i 


1 


11 


1220 


1 


1 2744] 


1 


i 



227 



wo 01/57274 PCT/USOl/00666 





Top Hit Descriptor 


yr77h04.r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:211351 5' | 




genes, complete cds 


genes, complete cds 


ZDl3a11.r1 Soares_parathyroid_tumor NbHPA Homo sapiens cDNA clone IMAGE:322172 5' | 


UI-H-BI4-apm-c-06-0-Ul.s1 NCI_CGAP_Sub8 Homo sapiens cDNA clone IMAGE:3a87754 3' | 


Mus musculus glucosamine-6-phosphate deaminase (Gnpi), mRNA | 


RCO CT0204 240999-021-blO CT0204 Homo sapiens cDNA | 


600942904F1 NIH^MGC_15 Homo sapiens cDNA done IMAGE:2959513 5' 1 


yv62h10.s1 Soares_multipIe_scIerosis_2NbHMSP Homo sapiens cDNA done IMAGE;278179 3' 1 


1 

1 
< 

i 
■| 

ii 
1 

1 


EST271 1 6 Cerebellum II Homo sapiens cDNA 5' end similar to EST containing Alu repeat 1 




i 
1 

1 
1 

Q 

i" 
1 
1 

1 

1 

t 
I 

1 

1 
1 


Homo sapiens telomerase reverse transcriptase (TERT) gene, exons 7-1 6 and complete cds | 


contains MERIO.bl MER10 repetitive element ; | 


< 

i 

1 
1 

i 
8 


601 45491 5F1 NIH_IVGG_66 Homo sapiens cDNA done IMAGE;3858626 5' | 


Subacute sclerosing panencephalitis (SSPE) virus mRNA for fusion protein | 


ti22c02.x1 NCI CGAP Kidll Homo sapiens cDNA clone IMAGE:2131202 3' similar to SW:R13A HUMAN 1 
P40429 60S RIBOSOMAL PROTEIN L13A ; 


M.thermoformicicum complete plasmid pFVI DNA | 


EST374237 MAGE resequences, MAGG Homo sapiens cDNA | 


Homo sapiens hypothetical zinc finger protein FLJ14011 (FU14011), mRNA | 


te91c12jt1 NCI CGAP Pr28 Homo sapiens cDNA clone IMAGE:2094070 3' similar to TR:O00519 000519 1 
FATTY ACID AMIDE HYDROLASE. ; | 


te91c1Zx1NCI CGAP Pr28 Homo sapiens cDNA clone IMAGE:2094070 3' similar to TR:O00519 000519 1 
FATTY ACID AMIDE HYDROLASE. ; | 


Mus musculus zinc-finger protein mRNA, complete cds 1 


601572746F1 NIH MGC 57 Homo sapiens cDNA clone IMAGE:3839747 5' | 


Top Hit 


Database 
Source 


EST HUMAN | 




NT 




EST HUMAN | 


EST HUMAN | 


NT 1 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 




EST HUMAN 1 


NT 1 


EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


NT 1 


EST HUMAN | 


NT 1 


EST HUMAN | 


NT 1 


EST HUMAN | 


|EST HUMAN | 


NT 


EST HUMAN | 


: 


i 

li 

c 

% 


i 


AF190338.1 


i 
§ 




W37985.1 1 


i 
§ 


67540291 


AW847284.1 | 


BE250108.1 1 


1 


AI016833.1 1 


AA324242.1 ] 


AL163281.2 | 


1 
i 


AF128894.1 | 


AI033980.1 1 


i 
1 


i 


D10548.1 1 


AI432661.1 1 




t 

i 


115458141 


i 

< 


AI420786.1 1 


i 
5 


BE737895.1 | 


Most Similar 1 


BLAST E 
Value 


% 


1 6.OE-O3I 


6.0E-03 


6.OE-O3I 


6.0E-03I 


6.0E-03I 


e.0E-03| 


6,0E-03| 


6.0E-03| 


6.0E-03| 


6.0E-03I 


6,0E-03| 


6.0E-03| 


6.0E-03I 


6.0E-03I 


6.OE-O3I 


6.0E-03I 


s 


6.0E-03I 


6.0E-03 


8 


6.0E-03I 


6.0E-03I 


6.0E-03I 


6.OE-O3I 


6.0E-03| 


6.0E-03I 


Expression 
Signal 








1.24 




3.621 




0.931 




0.87| 


1.371 


6.82| 




1 




7.O4I 












2.23| 


2.21 1 


» 








ORFSEQ 
ID NO: 






23062 


§ 




i 


23407 










242711 


E 


247671 


26542| 


26657] 






8 




280391 


I 






1 28357 




S 


Exon 
SEQ ID 
NO: 






13256 


g 


13416| 












1 


1 


1 


1 


1 






1 




i 






8 






i 


1 


1 Probe 1 


SEQ ID 
NO: 


1 


i 


333a 


3J 


1 


1 


37111 




38731 


4231 1 


4257 


4597| 


5115 




1 






1 




7776 


1 


8127 


1 


1 




i 


8351 1 
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Top Hit Descriptor 


j Rhodobacter capsulatus strain 3B1 003, partial genome | 


genome | 


[Anguilla japonica mRNA for activin B, complete cds | 




601 482621 Fl NIH_MGC_68 Homo sapiens cDNA clone IMAGE:3885388 5' 




complete ORFA, and grpE- 1 


1 
f 
< 

0 

1 
a. 
1 


complete ORFA, and grpE- | 


complete ORFA, and grpE- | 


1 Arabidopsis thaliana mRNA for DEAD box RNA helicase,RH3 | 


i 

1 
< 

i 

1 

I 

1 

1 
1 
1 


1 Homo sapiens mRNA for KIAA1 180 protein, partial cds | 


1601 1 94796F1 NIH_MGC_7 Homo sapiens cDNA clone IMAGE:3538799 5' 


I>^81f09,s1 Soares infant brain 1N1B Homo sapiens cDNA done IMAGE:22395 3' | 




|M86g02.s1 Soares breast 2NbHBst Homo sapiens cDNA clone 1MAGE:1 55566 3' | 






Citrus sinensis seed storage protein citrin mRNA, complete cds | 


IEST12218 Uterus tumor 1 Homo sapiens cDNA 5- end | 


yu79g10.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:240066 5' | 




Homo sapiens SCL gene locus | 




! 

i 

z 

i 
1 

i 
§ 
i 

5 
\ 

: 

iS 

1 

1 


iSODlUM CHANNEL PROTEIN PARA (PARALYTIC PROTEIN) | 


; aminoacyl-tRNA synthase, complete cds; 


"8 
1 

1 

i 

! 

1 




; aminoacyl-tRNA synthase, complete cds; 


la trachomatis partial ORFB; 
in, complete cds 


a trachomatis partial ORFB; 
in, complete cds 


a trachomatis partial ORFB; 
in, complete cds 


0 
1 

ll 

1 8 


Chlamydi 
lil^e prote 


Chlamydi 
Iil<e prote 


Chlamydi 
|lil<e prote 


1% 
i ^ 

gl 


1 Top Hit 


Database 
Source 










IeST HUMAN 1 














|EST HUMAN 1 




lEST HUMAN 1 


|EST HUMAN ] 




|EST HUMAN | 








|EST HUMAN | 


|EST HUMAN | 






i 


ISWISSPROT 1 




z 


i 


|aE000833.1 


|AB025356.1 | 


|u30790.1 


|BE788019.1 1 


IAJ245480.1 1 


yj5105.1 




L25105.1' 


L25105.1 


IAJ010457.1 1 


1 
< 


IAB033006.1 1 


|BE266057.1 1 




IAL161491.2 1 


|R71 794.1 1 


IAJ297357.1 | 


IAF147449.2 | 


i 




IH78355.1 1 


1 


IAJ131016.1 1 




< 


|P35500 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


6.0E-03I 


e.oE-osj 


6.0E-03| 


6.OE-O3I 


6.0E-03I 


6.0E-03I 


5.0E-03^ 


9 
»• 


5.OE-O3I 


8 


5.0E-03I 


8 

s 


5.0E-03I 


8 


8 

S 


5.0E-03I 


8 


8 


5.0E-03I 


5.0E-03I 


8 


5.0E-03I 


8 


5.0E-03| 


5.0E-03I 


5.0E-03I 


1 


■5 

(f 














o> 






^ 






? 




q 




§ 


q 


















c 


6 

Q 














20406 


20407I 


204061 


20407I 


208541 
















23341 1 








i 


241811 


1 


1 


1 Exon 1 


SEQ ID 
NO: 


1 189221 


s 






1 192621 


1 192741 




1 


1 


1 




1 




128321 


t 13024| 


i 




8 


1 


1 136121 




1 14106| 


i 




1 


1 


1 Probe 


SEQID 
NO: 


1 9185| 




9372 


,J 


9722 


1 






i 


i 


§ 


1 15521 


8 


a 


1 


1 3116| 


3129 


3240 


i 




1 


1 42071 


i 
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Top Hit Descriptor 


a a D. " 
m OQ o ■ i 
3 D ir " -5, 

^li if 

III II 

sis If 

^is- til 
ill III 

2 ^ ^ u' 1 E 
0 E «) i2- 5 2 1 

liiii 


BETA-GAUCTOSIDASE PRECURSOR (LACTASE) ' 

6g4F Heart Homo sapiens cDNA clone 694 " ' ■ 

xn59g05.Xl Soares_NHCeC_cervicaLtumor Homo sapiens cDNA clone IMAGE:2698040 3' similar to 

contains LI .t2L1 repetitive element : 


xn59g05.x1 Soares^NHCeC_cervicaLtumor Homo sapiens cDNA clone IMAGE:2698040 3' similar to 

contains LI .t2 LI repetitive element ; 

yb09eO4.r1 Stratagene placenta (SS37225) Homo sapiens cDNA clone IMAGE:70686 5' ■ 

Mus musculus hypothetical protein. MNCb-4760 (LQC58212), mRNA ■■ 

fz46c04.y1NCI CGAP Brn52 Homo sapiens cDNA clone IMAGF 99Q1R,, 


1 J . 

M ^ 0 CO S 

i ipili tiliip 
1 ii£ii|i|fitl|f i 

iJiigiilili iisi 
Itpili iiiiii ?i 

If 


Top Hit 
Database 
Source 


51 

is 

g (fl H H 


SWISSPROT 
NT 

EST HUMAN 
EST HUMAN 


ii i 


iiiiiiiiiiiii 


1 










Top Hit Ace 
No. 


s 1 1 i 

0 ffl < < 


P48982 
M61 132.1 
T1 9586.1 

AW1703341 


AW1703341 
T49153.1 

1094« 

BE048055.1 


AF067253.1 
LI 0347.1 

AA456597.1 

BF572332.1 

AW449109.1 

Q02388 

AI668709.1 

AW500196.1 

R46482.1 

^A939339.1 

^48482.1 

W749101.1 

\A099777.1 

\W794740.1 


:2 - Ui 

,i ^ 1- " 


.OE-03 
.OE-03 
.OE-03 
.OE-03 


.Ob-03 
■OE-03 

OE-03, 


OE-03 

OE-03 
OE-03 














Expression 
Signai 


2.43 
6.17 

6.02 
6.2s 


6.92 
7.44 

3.17 


3.17 
1.95 

— y 


1.55 

1.28 
4.16 
1.94 

1.41 
2.97 
2.18 
0.94 
3 

1.81 
3.43 
32.55 
1.58 


ORF SEQ 
ID NO: 


25695 

24877 
26605 


I i 


ill 


25197 

20015 
20097 
20203 

20637 

20889 
20910 


Exon 
NO: 


15594 
15881 
15101 
16424 




18275 
18307 


19091 
19153 

19174 
19529 
19301 
19601 
19400 
10201 
10280 
10380 
10526 
10785 ' 
10819 
11047 
11065 


Probe 
SEQ ID 
NO: 


5685 

5091 
6556 


7005 
8103 


S 3 S S g 


9460 
9555 

9616 
9791 
9808 
9929 
232 
318 
435 
588 
859 

1133 
1153 



230 




231 
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Top Hit Descriptor 


1 Homo sapiens K1AA0345 gene product (KIAA0345), mRNA | 


te49b1 1 .X1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA done INMGE:2090013 3' similar to contains Alu 1 
repetitive element; | 


Homo sapiens chromosome 21 segment HS21 C009 | 




1 Homo sapiens chromosome 21 segment HS21 COOS | 


|PM4-BN0138~13050C-OC2-b08 BN0138 Homo sapiens cDNA | 


I601118164F1 NIH MGC 17 Homo sapiens cDNA clone 1MAGE:3028095 5' | 


7q74c09.x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone I^MGE; 3' similar to contains Alu repetitive 1 
element;nontalns element MER31 repetitive element ; | 


hh02c07.x1 NCi_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2953932 3' similar to contains element 1 

LTR6 repetitive element. 


iRC3-ST0281-240400-015-fD3 ST0281 Homo sapiens cDNA | 


Homo sapiens Grb2-associated binder 2 (K1AA0571 ), mRNA 1 


Homo sapiens protein kinase CK2 catalytic subunit alplna gene, exon 1 | 


Homo sapiens protein kinase CK2 catalytic subunit alptia gene, exon 1 | 


1 nc73c05.El NCI CGAP Pr2 Homo sapiens cDNA clone IMAGE:7B2984 similar to contains Alu repetitive 1 


Homo sapiens MHC class 1 region | 




|Mus musculus intestinal trefoil factor gene, partial cds | 


Mus musculus intestinal trefoil factor gene, partial cds | 


'Arabidopsis thaliana rpoMt gene | 


IS01237982F1 NIH MGC_44 Homo sapiens cDNA clone IMAGE:3609933 5' | 


1 iL2-UM0076-240300-056-D03 UM0076 Homo sapiens cDNA . 1 


iMus musculus alpha-l(XVIll) collagen (COL18A1) gene, exon 1 and 2 | 


i 
1 


'AV762392 MDS Homo sapiens cDNA clone MDSBSGOI 5' | 


1 AV752392 MDS Homo sapiens cDNA clone MDSBSGOI 5' | 


|ah04f09.y5 Gessler Wilms tumor Homo sapiens cDNA clone 1MAGE:1 1 55589 5' | 




1 

i 

1 


'xuS.P10.H3 oonorm Homo sapiens cDNA 3' 


1 DKFZp761 B071 2 r1 761 (synonym: hamv2) Homo sapiens cDNA clone DKFZp761 B071 2 5' | 


Top Hit 
Database 
Source 


iNT 1 




iNT 1 


(NT 1 


z 


|EST_HUMAN | 


tESTHU^MN 1 


EST HUMAN j 


EST_HUMAN J 


lESTJHUMAN | 


iNT 1 


iNT 1 


INT 1 


|EST HUMAN | 


INT 1 


INT 1 


INT 1 


iNT 1 


iNT 1 


'EST HUMAN | 






z 


EST HUMAN | 


z 

% 


lEST HUMAN | 


z 


NT 


z 
1 
I 


'EST HUMAN | 


J 

Jl 


g 


1 


i 
< 


|AL1 63278.2 | 


|AL1 63206.2 | 


IBE815173.1 1 


i 


|bF224125.1 | 


i 
1 


IAW819141.1 1 




|AF01 1920.1 1 


i 

< 


i 


< 


^32521,1 J 


5 


1 
=) 


i 


i 

1 


AW802687.1 


=) 


|Y1 2500.1 1 


1AV752392.1 | 


IAV762392.1 | 




|Z32521.1 1 


IAJOI 1432.1 1 


IA1536141.1 1 


|AL1 19057.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


4.0E-03j 


I 


i 4.0E-03I 


4.0E-03I 


4.0E-03I 


4.0E-03I 


4.0E-03I 


4.OE-O3I 


4.0E-03I 


4.0E-03| 


4.0E-03I 


3.0E-D3I 


3 


3.0E-03^ 


3.0E-a3| 


S 


3.0E-03I 


3.0E-03I 


s 


9 
ri 


3.0E-03| 


s 


3.0E-03I 


3.0E-03I 


3.0E-03I 


3.0E-03I 


3.0E-03| 


I 


3.0E-03| 


3.0E-03| 


§ o 


1 2.02 1 










1 3.951 


1 2.251 




















1 I.I2I 




i 


L 4.53 








1 7,43| 


1 7.43 1 












ORFSEQ 
ID NO: 


1 268411 


B 




1 27229 1 














8 




1 206381 


1 




















1 23591 1 




1 23632] 




s 

I 




1 


Exon 
SEQ ID 
NO: 


1 






17035 


1 


19725 


i 


19202 




1 


i 


1 


10787 






12132 


12133 




§ 


12971 


i 


i 




13805 


13805 


1385S 


1 


14194 




14518 




























































SEQID 
NO: 


1 


1 70561 


7151 


i 


1 


1 


9317 


i 


1 


1 


i 


367 




1 


2211 


2248 


1 


1 




3044 


3112 


3371 




1 




s 
g 


4058 


1 


4417 


1 
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Single Exon Probes Expressed in Heart 


Top Hit Descriptor 

Kattus norvegicus mRNA for conne3dn36 (cx36 aena) 

RING CANAL PKOIbIN (KELCH PROTEIN) ' ■ ■ 

RING CANAL PROTEIN (KELCH PROTEIN) ■ 


O 

i 

i 1 
1 . ^ 

1 1 1 ^ 

g " Q " i= 

^ 1 S z 

%llU 

S S o c S 

UJll 

.111,1 <2- 

III pi 

^ X c I a. CO 


1 1 

1 1 

1 1 ^ 

I f 1 
k k <■ 

^ U a '1 

ram o c 

I I Ii 

■a f o i g 

1 1 

f 1 Ul 

r5 c<j Q- S S, 

^ ^ 1 2| s 
i s > 1 

™ < rag K- is B 

lilSgll 


? t 
1 1 

N ? 5- 1 § 

iii§ ^ III fir 
•.III!, nm m 

illli ill 

\WA iff 

Hill iiilll iil 

iiiliiiliiiii 


Top Hit 
Database 
Source 

NT 

SWISSPROT 
SWISSPROT 


EST_HUMAN 

EST HUMAN 
NT 


NT 
NT 

SWISSPROT 
NT 


NT 

EST HUMAN 
EST HUMAN 
EST HUMAN 

NT 

SWISSPROT 
NT 

MT 

VT 

EST HUMAN 

sIT 

MT 

gT 


TopHitAcession 
No. 

Q04652 


T70874.1 
M20783.1 
AAe61605.1 
AF284446.1 

P48509 


4557836 
4557836 

P29400 

AA450138.1 

AF302691.1 


AL163302.2 
AW137782.1 
AA450138.1 
BF568955.1 

X87344.1 

AB040802.1 

P03374 

U68491.1 

L42512,1 

-42512.1 

\F223391.1 

=187773.1 

\J245167.1 

\Y005150.1 
\Y005150.1 


Most Similar 
(Top) Hit 
BLAST E 
Value 

3.0E-0a 
3.0E-03 
2.0E-03 
2.0E-03 


2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 

2.0E-03 


2.0E-03 

2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 


2.0E-O3 
2.0E^3 
2.0E-03 
2.0E-03 

2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 

2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 
2.0E-03 / 




Expression 
Signal 

1.35 
2.33 
0.89 
0.89 


5 g S ^ 




1.04 
4.62 
4.03 
1.02 

4.87 
0.92 
2.15 
9.77 
1.93 
1.93 

0.94 

0.82 

1.04 




ORFSEQ 
ID NO: 

25296 
20258 
20259 


21104 
21106 
21115 

21243. 


21263 

21264 

21502 
21731 


23088 
23093 

23315 
23572 
23703 

24130 
24131 

24285 

24820 
24821 




Exon 
SEQID 
NO: 

19668 
1901S 

10446 




11404 

11404 
11477 
11634 
11845 


12409 
13289 
13294 

13529 
13780 
13927 
14027 
14340 

14495 
14500 
14947 
15057 
15057 




Probe 
SEQID 
NO: 

9159 
9339 
504 
504 




1500 

1500 
1573 
1733 
1950 


2535 
3370 
3376 

3615 
3869 
4024 
4127 
4445 
4446 

4S08 
4612 
5077 
5194 
5194 



1 
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2 
I 


60187S385F1 NIH_MGC_55 Homo sapiens cDNA clone IMAGE:4104692 5' 


Homo sapiens mRNA for KIAA0693 protein, partial cds | 


Xenopus laevis xefiltin mRNA, complete cds | 


iATP-DEPENDENT NUCLEASE SUBUNIT B | 


lATP-DEPENDENT NUCLEASE SUBUNIT B | 


1 CARBONIC ANHYDRASE-RELATED PROTEIN 2 PRECURSOR (CARP 2) (CA-RP 11) (CA-XI) | 


CARBONIC ANHYDRASE-RELATED PROTEIN 2 PRECURSOR (CARP 2) (CA-RP II) (CA-XI) | 


6018S7434F1 NIH MGC 17 Homo sapiens cDNA clone IMAGE:4121408 5' [ 


ADAM-TS 7 PRECURSOR (A DISINTEGRIN AND METALLOPROTEINASE WITH THROMBOSPONDIN 
1 MOTIFS 7) (ADAMTS-7) (ADAM-TS7) 


1 L.esculentum mRNA for lysyl-tRNA synthetase (LysRS) | 




1 CM4-BT0366-061299-054-d01 BT0366 Homo sapiens cDNA [ 


hf37b06.x1 Soares NFL T GBC SI Homo sapiens cDNA clone 1MAGE:2934035 3' similar to TR:Q60976 1 
Q60976 JERKY. ; 


yx42g06.s1 Soares melanocyte 2NbHM Homo sapiens cDNA clone iMAGE:264442 3' similar to contains 1 
[ L1 .b2 L1 repetitive element ; | 


L1.b2L1 repetitive element; | 


TENASCIN PRECURSOR (TN) (HEXABRACHION) (CYTOTACTIN) (NEURONECTIN) (GMEM) (Jl) 
(MIOTENDINOUS ANTIGEN) (GLIOMA-ASSOCIATED-EXTRACELLULAR MATRIX ANTIGEN) (GP 1 50- 
j225) (TENASCIN-C) (TN-C) 


i 

1 

i 

1 
< 
§ 

i 
§ 

1 
0 

0 
Z 

i 


Human dystrophin gene [ 


PROTEOGLYCAN LINK PROTEIN PRECURSOR (CARTILAGE LINK PROTEIN) (LP) [ 


|RC3-BT0333-310800-115-g04 BT0333 Homo sapiens cDNA | 


(H.sapiens variable number tandem repeat (VNTR) locus DNA j 


ty65h03.x1 NCI CGAP Kidll Homo sapiens cDNA clone IMAGE:2283989 3' similar to SW:VATG MANSE 1 
Q25532 VACUOLAR ATP SYNTHASE SUBUNIT G ; ' | 


Homo sapiens SEL1 L (SEL1 L) gene, partial cds | 




0 

i 
0 

Z 

-i 
1 

E 
0 

1 
1 


Database 
Source 


z 

i 

B 


1 INI 


|NT 1 


[SWISSPROT 1 


ISWISSPROT 1 


[SWISSPROT 1 


[SWISSPROT 1 


[EST HUMAN [ 


SWISSPROT 


1 INl 


EST HUMAN 


[est HUMAN 


Iest human 1 


|est human 1 


|est human 1 


SWISSPROT 


[EST HUMAN | 


[NT 1 


[SWISSPROT [ 


[EST HUMAN [ 


iNT [ 


EST HUMAN | 


iNT [ 


[NT [ 


1 


TopHitAoession 


z 


|BF241410.1 1 


i 


|U63711.1 1 


IP23477 1 


IP23477 1 


IQ95203 1 


[Q95203 1 


[BF308187.1 [ 


Q9UKP4 




§ 
1 




AW592004.1 1 


|n20287.1 


|n20287.1 1 


P24821 


[AA251 376.1 | 


[M86524.1 1 


IP07354 [ 


i 


IZ11740.1 [ 


AI625745.1 | 




|AJ245167.1 [ 


1 

1 


Most Similar 
(Top) Hit 


BLAST E 
Value 


1 2.0E-03| 


1 2.0E-03I 


1 2.0E-03| 


S 


1 2.0E-O3I 


1 2.0E-03I 


[ 2.0E-03[ 


[ 2.0E-03[ 


2.0E-03 




2.0E-O3 


\ 2.0E-03I 


2.OE-O3I 


1 2.0E-O3 


1 2.OE-O3I 


3 


[ 2.0E-03I 


[ 2.0E-03[ 


1 2.0E-03[ 


S 


1 2.0E-03[ 


1 2.OE-O3I 


1 2.0E-03[ 


3 


[ 2.0E-03I 


§ 
1 




1 ■"■37| 








1 3,25| 




1 1.82[ 










i 










[ 5.41 1 


1 2.78[ 






1 12.17[ 


2.07 






1 2.76| 


ORFSEQ 


ID NO: 


1 251 28| 


1 254381 






1 25725| 




[ 25847[ 


1 25849] 










1 26763] 


1 




1 






§ 








1 29115] 






Exon 
SEQ ID 
NO: 


1 


1 


1 


i 


1 156241 


1 157351 


[ 15735| 


1 157371 


15756 






i 


16571 1 


i 


1 




17625| 


s 


[ 16350[ 


i 




1 


1 


s 




Probe 
SEQID 
NO: 


1 5372 




1 


5716 


L 57161 




i 


5831 


5850 


5861 


s 


1 6212| 


1 


1 67841 




7513 


1 7775 1 


8 


8779 


1 


i 


9052 


s 


9113 
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Top Hit Descriptor 


1 

1 

1 
! 

z 

1 

1 

% 


j 

5 
1 

i 
i 

z 

1 
§ 

1 


IPM0-HT0339-200400-01 0-D02 HT0339 Homo sapiens cDNA | 


1 APOLIPOPROTEIN A-IV PRECURSOR (APO-AIV) | 


IAV685870 GKC Homo sapiens oDNA done GKCDME11 5' | 


tzs44roi Jl NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:700345 5' | 


1 Homo sapiens KVLQT1 gene | 


Epstein-Barr virus (AG876 isolate) U2-IR2 domain encoding nudear protein EBNA2, complete cds | 


1 Epstein-Barr virus (AG876 isolate) U2-IR2 domain enccxiing nudear protein EBNA2, complete cds | 


ICOLLAGEN ALPHA 1(V1I) CHAIN PRECURSOR (LONG-CHAIN COLLAGEN) (LC COLLAGEN) | 


8 


Homo sapiens T-ceii iymptioma invasion and metastasis 1 (TIAM1), mRNA | 


yd93a11.r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:115772 5' | 


|QV3-NN1024-2B0400-171-g05 NN1024 Homo sapiens oDNA | 


[Human gene for fourtti somatostatin receptor subtype | 


1 Homo sapiens 959 l<b contig between AML1 and CBR1 on ctiromosome 21q22, segment 2/3 | 


CDM protein (CDM), adrendeukodystrophy protein > 


IHuman TRPM-2 protein gene, exons 1 ,2 and 3 | 


I 

5 

1 

1 
1 
i" 
1 




|V.carteri gene encoding volvoxopsin | 








1 
1 

1 

1 
I 

A 

f. 

c 
1 

1 


|RC1-CT0279-181099-011-a09 CT0279 Homo sapiens cDNA | 


|RC1-CT0279-181099-011-a09 CT0279 Homo sapiens cDNA | 


|Q V3-HT0543-220300-1 30-a03 HT0543 Homo sapiens cDNA | 


|tt73e12.x1 NCI CGAP_HSC3 Homo sapiens cDNA clone IMAGE:2246446 3' similar to TR:Q26195 Q26195 1 
|PVA1 GENE. ; | 


1 AV759949 MDS Homo sapiens cDNA done MDSDDF1 1 5' ( 


|601433087F1 NIH_MGC_72 Homo sapiens cDNA clone IMAGE:3918524 5' | 


Top Hit 
Database 
Source 


lEST HUMAN 1 


lEST HUMAN 1 


lEST HUMAN 1 


ISWISSPROT 1 




|EST HUMAN | 


z 


\- 
z 


z 


ISWISSPROT 1 


z 


Z 


IEST HUMAN 1 


lEST HUMAN | 












z 


z 










1 


|EST HUMAN | 


|EST HUMAN 1 


|eST HUMAN 1 


EST HUMAN 


|EST HUMAN 1 


TopHitAcession 
Na 


IAI073485.1 1 


IAI073485.1 . 1 


IBEI 54067.1 1 




i 


IAA290951.1 1 


|AJ006345,1 1 


IK03332.1 1 


IK03332.1 1 




i 


1 


IT87761.1 1 


IAW902585.1 j 


D1 5825.1 


|AJ229042.1 I 


U52111.2 


IM63376.1 


|AJ251973.1 


AF1 53980.1 


? 


M30471.1 


i 


|AF01 1400.1 


lAFO1 1400.1 


IAW362393.1 


IAW362393.1 


i 


|aI583847.1 


i 

1 




IWIost Similar 
(Top) Hit 
BLAST E 
Value 


.OE-03 


.OE-03 


.OE-03 1 


.OE-03 


.OE-03 


EO-30- 


.OE-03 


.OE-03 


EO-30- 


.OE-03 


EO-30- 


.OE-03 1 


.OE-03 


.OE-03 


.OE-03 


s 


.OE-03 


.OE-03 


.OE-03! 


.OE-03 


.OE-03 


.OE-03 


.OE-03 


.OE-03 


£0-30- 


.OE-03 


.OE-03 


.0E-03i 


.OE-03 


.OE-03 


.OE-03 
































































§ 

11 

r 






?3 

ID 


1 




















































































ORF SEQ 
ID NO: 








24649 




1 






1 


1 253971 






25842 




1 








1 267071 








27346 


27602 




E 


28197 








1 29113| 


Exon 
SEQ ID 
NO; 


1 


1 


i 




1 


15173 


1 


1 


1 




15577 


1 


1 


1 




i 


S 


S 


16516 


1 


17052 


g 


s 






17947 


17947 


§ 


i 


18370 


i 
































































Probe 
SEQ ID 
NO: 


5 


4717 




§ 

s 






1 53171 


1 


i 


1 54231 


1 


s 


1 58241 


1 


1 62421 


1 641 9| 




i 


i 


i 


1 71751 


7273 


7273 


1 




1 


i 


8133 


1 


1 


1 
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Table 4 

Single Exon Probes Expressed in Heart 




Top Hit Descriptor 


.a 

1 
8 

i i 

" lO « < 
g S o ^ 

li i 1 

!i ii ! i 
II l! 1 1 

1 9 E id 

1! tigl 

1 fill 
iilllillli. 


1 

1 ii . . ii ill 

is II ^ i 1 JI m 

«^ lis 1 iipf^gl 

ii |t| iil!il|i^i<l!i 
Ii 11 ifi^llllli^i- 1 

111 !i-ni|i!itili 

i5iliiiJ-i||i||i,iillifii 
ikpil i'llli 8,ff i 
iilililiiiiiilililsiiili 


Top Hit 


Source 


EST HUMAN 
EST HUMAN 
NT 

SWISSPROT 

NT 

NT 

EST_HUMAN 1 
SWISSPROT 


ii 1 iii i iii 


1 

i 
1 




AI347355.1 
BE780572.1 
L11910.1 
P06727 
AB037203.1 
X96469.1 
R07008.1 
P08547 
I291SS 1 


AA777084.1 
AI571099.1 
L41 825.1 
U29185.1 
AL1 63210.2 

AI769331.1 

U78027.1 
Z40561.1 
BE077941.1 
R1 7336.1 

6005855 
AI862525.1 
K01315.1 
<01315.1 
J45983.1 

'464D8 < 

\L048507.2 

3E005850.1 

U229042.1 

W01 3847.1 E 

301758 £ 




MostSimilai 
(Top) Hit 


BIJ^STE 
Value 

i 1.0E-OS 


OOOOcpSSSS 


i.OE-04 
.OE-04 
.OE-04 
.OE-04 

.OE-04 
.OE-04 

OE-04 
OE-04 
OE-04 
OE-04 
OE-04 
OE-04 
OE-04 
OE-04 

DE-04 

)E-04 
)E-04 5 
)E-04/ 








! Expression 




2.15 
4.2 
0.81 

1.42 
1.39 
1.27 
4.2 
2.39 


2.08 
2.16 
1.61 
1.13 

1.03 
2.24 

3.2 

3,29 
2.72 
3.28 
1.34 
0.85 
0.85 
3.26 
4.35 
3.13 
2.24 
2.07 
6.32 
2.45 




ORFSEQ 
ID NO: 

2528/ 


ii i i 


21562 
22127 
22437 
22S65 

28973 

23576 

23681 
23769 

27887 
28881 
28950 




Exon 
SEQ ID 


NO: 


19687 

14998 
15404 
17412 
11377 
13754 
13989 
14542| 


18483 

12230 
12547 
13165 
15828 

18683 
19155 
19300 
19321 
13788 
13905 
13905 
13992 
16470 
17583 

18664 
187141 






9626 
5131 
5485 
7561 
1472 
3843 
4089 
4656 
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Top Hit Descriptor 


|AU146707HEMBB1 Homo sapiens cDNA clone HEMBB1001253 3' | 


iHuman dystrophin gene | 


Human dystrophin gene | 


qh98e1 1 .x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:1855052 3' similar to contains 1 
MER3.b2 MER3 repetitive element ; 


|Homo sapiens chromosome 21 segment HS21G003 | 


i 

.§ 

! 

1 

1 


zu39b05.s1 Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:740337 3' similar to contains Alul 
repetitive element; | 


Human germline T-cell receptor beta chiain TCRBV17S1A1T, TCRBV2S1 , TCRBV10S1P, TCRBV29S1P, 
TCRBV19S1P, TCRBV15S1, TCRBV11S1A1T, HVB relic, TCRBV28S1P, TCRBV34S1 , TCRBV14S1 , 
TCRBV3S1. TCRBV4S1A1T, TRY4, TRY5, TRY6. TRY7, TRY8, TCRBD1, TCRBJ1S1, TCRBJ1S2,> 


am58c09j<1 Johnston frontal cortex Homo sapiens cDNA clone IMAGE:1539750 3' | 


f 

i 

X 

i 

i 

i 

i 

i 


H uman tyrosine kinase TXK (txk) gene, exono 9 and 1 0 | 


EST390550 MAGE resequences, MAGP Homo sapiens cDNA | 




i 

S 
i 

1 
1 

I 

1 

1 

1 
f 

i 


|yu01e11.r1 Soares_pineal _gland N3HPG Homo sapiens cDNA clone IMAGE.232556 5' | 


< 
1 

1 

1 

1 
§ 
1 

1 


1 Danio rerio hagoromo gene, exons 1 to 6, partial cds | 


IAV654352 GLC Homo sapiens cDNA clone GLCDUH10 3' | 


1 

i 

< 

1 

1 
I 

a 
0 

1 


i 
§ 

i 
1 

1 

i 

! 
i 

1 


LINE-1 REVERSE TRAIMSCRIPTASE HOMOLOG 

MYOMESIN 2 (M-PROTEIN) (165 KD TITIN-ASSOCIATED PROTEIN) (165 KD CONNECTII^- 
ASSOCIATED PROTEIN) 


Solanum lyocpersioum phytochrome F (PHYF) gene, partial ods 

Solanum lyocpersioum phytochrome F (PHYF) gene, partial ods | 


Homo sapiens DNA, DLEC1 to 0RCTL4 gene region, section 1/2 (DLEC1 , 0RCTL3, 0RCTL4 genes, 1 
complete cds) | 


Homo sapiens DNA, DLEC1 to ORCTL4 gene region, section 1/2 (DLEC1, ORCTL3, 0RCTL4 genes, | 
Icomplete cds) | 


Top Hit 
Database 


|EST HUNMN | 


z 




EST HUMAN J 






|est_human 1 




|EST HUMAN | 








z 


[EST HUMAN 1 






z 


lEST HUMAN 1 


|EST HUMAN | 


[EST HUMAN J 


SWISSPROT 




z 




TopHitAcession 
No. 


iAU146707.1 1 


M86524.1 \ 




'aI286021.1 1 


IAL163203.2 | 


IAF224268.1 | 


IAA478980.1 | 




i 
< 


|BE082317.1 1 


,U34374.1 1 


i 

i 

< 


1U01029.1 1 


H96265.1 1 


IH96265.1 1 


,U09226.1 1 


AB037997.1 | 


IAV6543521 | 


i 
§ 
< 


AU121712.1 1 


1 1 

§ S 


1U32444.2 
U32444.2 1 


1 


i 

< 


IWost Similar 
(Top) Hit 
BLAST E 
Value 


1 2.0E-04I 


1 2,0E-04| 




S 


s 




1 2.OE-O4I 






1 20E-04| 




1 2.0E-04I 


1 2.0E-04I 




s 


1 2,0E-04| 


1 2.0E-04I 


1 2.0E-04I 




3 


3 3 


3 3 


1 2.OE-O4I 


3 


Expressfon 
Signal 




f 


















§ 




























ORF SEQ 
ID NO: 


1 202321 


i 


§ 


















s 










24630] 










26708 
I 267091 




i 














i 


i 


1 










1 


1 


1 




i 


S 








16518 


1 


1 




















































Probe 
SEQ ID 
NO: 


1 


i 


i 




1 






1 




i 




1 




1 


1 


1 


1 




s 


i 


1 1 
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Table 4 

bes Expressed In Heart 


Top Hit Descriptor 


i 
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i s X ^ ^ 
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^ <i ^ ^ 

i "J 2 i 

^ ^ - & ^ o S 
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tl 

1 fi 

if If 
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ijl! 

^ & 1 2 

an 

; & 1 1 1 

1 1 1 1 i 

lili 


|| 

si f- 

I i 8 S 

II Jl 

II III 

ii ili 
Ii lii 

o °: ^. 1 i 

ft fjj 

8 o « o § § 

I I I i 1 1 

llllll 


si 

if 

\ 1% 

mi 

- S S o ? 
lii 2 S ?! 

1111 . < 

'lill ill 

.1 g z g 2 g 1 

g- §• LU i .2 X & 

iiii fill 

^-^,§1 Hi! 

ills ^i^i 
Iflllllll 


Page 

igle Exon Pro 


4^ 


Z 2 Z 
III 


ii i iii 

? i?' 


z z 


— 


lis i is 


W 


1 

8- 


AF020503.1 
BE149303.1 
AA405777.1 
AV730373.1 
AJ243213.1 


Ii i ill 


i f 
"=> < 


IIII 
< < < 


^ i 

lis 

m ilii 




pP 


iiili 


ii 1 III 




ssss 

o p p q 


1.0E-04 ! 
1.0E-04 ( 

1.0E-04/ 
1.0E-04 N 
1.0E-04 P 
1.0E-04 F 
1.0E-04 




g 

'g g 
1" 


^ ^ p: ? § 






q ^ s 


? ^ 9 ^ S p M S 




a 

!n o 
u 2: 
a: S 
o 


iiii 


a § s ? IS [g 
s s ^ 1 s 1 
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^ii iiili 




lii 


iiili 


ii i ill 
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S S S S ¥2 


ii ^ ill 
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Top Hit Descriptor 


1 Homo sapiens KIAA0237 gene product (KIAA0237), mRNA | 






1 LINE-1 REVERSE TRANSCRIPTASE HOMOLOG | 


qv57d10.x1 NCI CGAP Ov32 Homo sapiens cDNA clone IMAGE:1 985683 3' | 


jqv57d10.x1 NCI CGAP_Ov32 Homo sapiens cDNA clone IMAGE:1 985683 3' | 


|wf26e08.x1 Soares NFL T GBC SI Homo sapiens cDNA clone !MAGE:2356742 3' | 


1 CYSTATIN-RELATED EPIDIDYMAL SPERMATOGENIC PROTEIN PRECURSOR (CYSTATIN 8) 1 




1 LINE-1 REVERSE TRANSCRIPTASE HOMOLOG | 


Mouse alptia leukocyte interferon gene, complete cds | 


Homo sapiens mRNA for KIAA1 1 42 protein, partial cds | 


|CM0-CT0404-130700-475-h03 CT0404 Homo sapiens cDNA | 




ati45c11.s1 SoaresJestis_NHTHomosapienscDNA clone 1292468 3' | 


|wi54c1 1 ,x1 NCI CGAP Co16 Homo sapiens cDNA clone IMAGE:2394068 3' similar to contains MER6.t1 1 
MER3 repetitive element ; | 


1 PROLYL 4-HYDRdXYLASEALPHA-2 SUBUNIT PRECURSOR | 




IHomo sapiens methyl-CpG binding protein 1 (MBD1) gene, exon 15b 1 


xa34g05,x1 NCI_CGAP_Br1 8 Homo sapiens cDNA clone IMAGE:2568728 3' similar to contains LI .12 LI 1 
repetitive element ; 1 


qv23f06.x1 NCI_CGAP_LvmS Homo sapiens cDNA clone IMAGE;1 982435 3' similar to contains element 1 
MIR repetitive element; | 


1 PROLYL 4-HYDROXYLASE ALPHA-2 SUBUNIT PRECURSOR | 


Homo sapiens MSH55 gene, partial cds; and CLICI, DDAH, G6b, G6c, G5b, G6d, G6e, G6f, BATS, G5b, 
C3K2B, BAT4, G4, Apo M, BAT3, BAT2, AIF-1, 107. L3T-1, LTB, TNF, and LTA genes, complete cds 


Pisum sativum mRNA for beta-1 ,3 gluoanase (gns2 gene) | 


Pisum sativum mRNA for befa-1 ,3 gluoanase (gns2 gene) | 




HSPD1 6734 HM3 Homo sapiens cDNA clone s3000a03H04 1 




1 Top Hit 1 


Database 
Source 


iNT 1 


|EST HUMAN | 




ISWISSPROT 1 


lEST HUMAN | 


lEST HUMAN | 


|EST HUMAN 1 


ISWISSPROT 1 


[NT 1 


ISWISSPROT 1 


iNT 1 


NT 


|EST HUMAN | 


[EST HUMAN | 


Iest_human 1 


EST HUMAN | 


ISWISSPROT 1 


NT 


NT _ 


EST HUMAN | 


1 


ISWISSPROT 1 


NT 


iNT 1 


iNT 1 


iNT .1 


|EST HUMAN 1 


[NT 1 


1 

I 

X 

§ 


No 




IAI357156.1 1 


iZ72560.1 1 


|P0a547 1 


|AI251980.1 1 


|AI251 980.1 1 


|AI806220.1 1 


O 


i 


i 


IM28587.1 1 


< 


s 
i 


|AW893325.1 | 


1AA718933.1 1 


i 

< 


1Q60716 1 


5 
§ 
S 


s 
< 


AW073078.1 1 


AI287878.1 | 


IQ60716 1 


AF1 29758.1 


|AJ251546,1 1 


1AJ251646,1 1 


IM83575.1 1 


|F28172.1 1 


i 

s 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1. OE-04 


I.OE-041 


1. OE-04 


I.OE-041 


1 .OE-04 


I.OE-04 


I.OE-041 


1 .OE-04 1 


l.0E-04| 


1. OE-04 


1 .OE-04 1 


I.OE-041 


9 


1 .OE-04 1 


1 9.0E-05' 


9.0E-05 


1 9.0E-05I 


S 


1 9.0E-05 


9 


9.0E-05 


1 9.0E-05I 


9 


1 8.0E-05I 


1 8.0E-05I 


1 8.0E-05I 


1 8.0E-05I 


1 8.0E-05I 






























1 


a 






d 








2.53| 




1.75 




2.171 












1.471 




8 










° 






1.15 




C 

o 


ID NO: 


1 24672| 


1 






1 


1 


1 274551 




27628 1 






290121 






i 


23678| 


25632 1 






1 


287051 


25632| 




20576| 








i 


Exon 
SEQ ID 
NO; 


14901 


14905 


15015 


15487 






17249 


1 


17413 


1 


§ 


18721 


i 


s 




13902 


15543 




i 


18352 


S 


15543 


1 


10733 


10773 


s 


g 


i 


























































Probe 
SEQ ID 
NO: 


1 


1 5033 


5148! 


1 5572 


i 


1 


s 


1 73841 


1 75621 


1 7848| 


1 


1 


1 9035 1 


i 


i 


i 


i 


i 


1 74571 


1 8479! 










1 


1 2920| 


i 


1 
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Top Hit Descriptor 


zs88h01 .s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:704593 3' similar to contains Alu 1 


RC3-CT0208-2209a9-011-E04 CT0208 Homo sapiens cDNA | 


RC3-CT0208-220999-011-E04 CT0208 Homo sapiens cDNA | 


|HUM072014F Human fovea cDNA Homo sapiens cDNA clone EST HFD072014 | 


IHUM072014F Human fovea cDNA Homo sapiens cDNA clone EST HFD072014 | 


PROBABLE GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE, MITOCHONDRIAL PRECURSOR 
l(GPAT) 1 


Homo sapiens chromosome 21 segment HS21C078 | 




Homo sapiens chromosome 21 segment HS21C001 | 


Caenoriiabditis elegans Skp1 p homolog mRNA, complsle cds | 


Rat cytomegalovirus Maastricht, complete genome | 


IEST78713 Placenta 1 Homo sapiens cDNA | 


EST04984 Fetal brain, Stratagene {cat#93e206) Homo sapiens cDNA clone HFBED60 | 


Homo sapiens sarcogii^an, epsilon (SGCE), mRNA 


Homo sapiens chromosome X open reading frame 6 (CX0RF6) mRNA | 


Homo sapiens chromosome X open reading frame 6 (CXORF6) mRNA | 


Iwb54h06 x1 NCI CGAP_GC6 Homo sapiens cDNA clone IMAGE:2309531 3' similar to gb:J03250 DNA 1 
TOPOISOMERASE 1 (HUMAN); | 


1 

§ 

1 

1 
± 

1 
1 

i 


H.sapiens flow-sorted chromosome 6 HIndlll fragment, SC6pA28B1 0 | 




ICONTACTIN PRECURSOR (GLYCOPROTEIN GP135) | 


CONTACTIN PRECURSOR (GLYCOPROTEIN GP135) 


|yv50g1 1 rl Scares fetal liver spleen 1 NFLS Homo sapiens oDNA clone 1MAGE:246212 S \ 


I 
t 

i 
§ 

9 
§ 


[C4B-BINDING PROTEIN PRECURSOR (C4BP) | 


1 
O 

o 
z 

i 

5 
z 


|ye28o12.r1 Stratagene lung (#937210) Homo sapiens cDNA clone IMAGE:119062 5' | 


|yi5Sd08.s1 Scares placenta Nb2HP Homo sapiens cDNA clone IMAGE:143535 3' similar to contains Alu 1 


|Zk58f02.r1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone IMAGE:487035 5' 


< 

t 

1 
I 

§ 

i 

§ 
2 


Top Hit 
DatalJase 
Source 


EST HUMAN | 


|EST HUMAN | 


|EST HUMAN | 


[EST HUlVIAN 1 


|EST_HUMAN | 


SWISSPROT 1 


iNT 1 


NT_ J 


iNT 1 


iNT 1 


iNT 1 


lEST HUMAN | 


|EST HUMAN ] 


NT 


iNT 1 


NT J 


EST HUMAN | 


iNT 1 


iNT 1 


iNT 1 


ISWISSPROT 1 


iSWISSPROT 1 


lEST HUMAN | 


|EST HUMAN | 


o 

IT 
1 


ISWISSPROT 1 


jEST HUMAN | 


[est human 1 


|EST HUMAN 


'EST_HUMAN | 


TopHitAcession 
No. 


|aA279333.1 


|AW847445.1 | 


|AW847445.1 


|L49075.1 1 




|q22949 1 


|AL1 63278.2 | 




IAL163201.2 1 




8 


1 
1 


IT07095.1 ] 


] 108350461 


] 4885170] 


1 


< 


]Z84505.1 ] 




i 

s 


i 

a 


1 

a 


1N72829.1 ] 








]T94149.1 1 


1 


AA044015.1 


]AW890110.1 1 


Most Similar 
(Top) Hit 
BI-ASTE 
Value 


1 8.0E-O& 


1 7.0E-05j 


1 7.0E-O5I 


1 7.0E^5| 


1 7.0E-05| 


7.0E-05 


1 7.0E^5| 


1 7.0E-O5I 


1 7.0E-05I 


1 7.0E^5| 


1 7.0E-05| 




] 7.0E-05I 


] 7.0E-05] 


s 


1 6.0E-05] 


S 


] 6.0E-05] 


9 


1 6.0E-05I 


9 


] 6.0E-05I 


] 6.0E^5] 


9 


] 6.0E^5| 


] 6.0E-05] 


] 5.0E-05I 


e.oE-osI 


6.0E-05 


s 


Expression 
Signal 














1 5.241 






s 






1 3.04] 










§ 




!? 






? 






1 1.37] 


^ 








ORF SEQ 
ID NO: 








i 






1 224401 


228431 


s 


24023] 


24508] 




27505] 










1 22409] 


22410] 


204151 


i 


i 


25879] 


i 


1 


27438] 


27558] 


28267I 




25073] 


Exon 
SEQID 
NO: 


1 










10957 


12551 


13046 




14241 


14723 


15065 


17296 


18374 








12519 


12519 


i 










17234 


17234 


17354 






19587| 






























































Probe 
SEQID 
NO: 


1 9911 1 


344 






s 


,J 


1 


3121 


1 


4344 


1 4842| 


5202 


1 




1981 


i 




i 


2652 


1 2783 1 


i 


i 


& 


1 


7330 


7330 


7484 


.3 


1 


i 
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Table 4 

Single Exon Probes Expressed in Heart 


: 

ii 

11 

ii 

f 

0 w 

P 

1 ^ 

II 

It 


MER3.b2 MER3 repefifive element ; 

Human adenosine deaminase (ADA) gene, complete cds ^ 

zq4Sa12.r1 Stratagene hNT neuron (#937233) Homo sapiens cDNA clone IMAGE:632734 ff similar to 
contains Alu repetitive element;contains element L1 repetitive element : 

RC3-BT0319-120200-014-h08 BT031 9 Homo sapiens cDNA 

Homo sapiens p47-phox (NCF1) gene, complete cds - 
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.i| 

!1 

i i 

11 
1 i 

ii 

1 

If 
?i 

1 1 

11 


5 

q: 
3 

1 

i 

< i 

i i 

t i 
i i 

ii 

|i| 
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Top Hit Descriptor 


xa89a03,x1 NCLCGAP_Co1 7 Homo sapiens oDNA clone IMAGE:2S73932 3' similar to contains L1 .b3 L1 
repetitive element ; | 


[Homo sapiens ABCA1 (ABCA1) gene, complete cds | 


1 AU131 51 3 NT2RP3 Homo sapiens cDNA clone NT2RP3002707 5' | 






1 MOSAIC PROTEIN LGN | 


|Homo sapiens chromosome21 segment HS21 0003 | 


i 

i 
1 
i 
i 

Si 

E 
I 

1 
1 


xy49g1 1 x1 NC;_CGAP_Lu34.1 Homo sapiens cDNA clone IMAGE:2856548 3' 1 


Homo sapiens chromosome 21 segment HS21C046 ) 


Homo sapiens Spast gene for spastin protein | 


|ns19g02s1 NCI CGAP_GCB1 Homo sapiens cDNA done 1MAGE:1 1841 14 3' similar to contains Ll.t1 Ll 1 
L1 repetitive element; 


|Homo sapiens phospholipase A2. group X(PLA2G10) mRNA, and translated products | 


|52 KD RO PROTEIN (SJOGREN SYNDROME TYPE A ANTIGEN (SS-A)) (RO(SS-A)) | 




|zx35h12.s1 Scares total fetus Nb2HF8 9w Homo sapiens cDNA clone IMAGE:788519 3' similar to 1 
gb:L02932 PEROXISOME PROLI FERATOR ACTIVATED RECEPTOR ALPHA (HU MAN); | 


zsOSel 1 n NCI_CGAP_GCB 1 Homo sapiens cDNA clone IMAGE:684332 5' similar to contains Alu 1 
repetitive element;contains element TAR1 repetitive element ; 


]UI-H-BI2-agk-a-08-0-Ul.s1 NCI_CGAP_Sub4 Homo sapiens cDNA done IMAGE:2724398 3' | 


1 
< 
§ 

t 

1' 
0 

i 

t 
5 


ha07c10.x1 NCI„GGAP_Kid12 Homo sapiens oDNA done IMAGE:2873010 3' similar to contains Ll .t2 Ll 1 
[repetitive element ', | 


(HLA-H) gene, RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


(HLA-H) gene, RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


|tt73a06.x1 NCI_CGAP_HSC3 Homo sapiens cDNA done 1MAGE:2245386 3' 


qg11b08jd Soare5_plaoenta 8to9weeks 2NbHP8to9W Homo sapiens cDNA clone 1MAGE:1759191 3' | 


Top Hit 
Database 
Source 


1 


INT 1 


lEST HUMAN | 


\m _ J 


INT 1 


ISWISSPROT 1 


INT 1 


1 


lEST HUMAN 1 


iNT 1 


NT 1 


EST HUMAN 


iNT 1 


;SWISSPROT 1 


INT 1 


EST HUMAN | 


1 

EST HUMAN 


IEST_HUMAN 1 


,EST HUMAN | 


I 


NT 


NT 


EST HUMAN 1 


EST HUMAN | 


TopHitAcession 
No. 


AW074604.1 1 


IAF275948.1 | 


|AU131 513.1 1 


< 


i 

i 
< 


1P81274 1 


i 




|AW419134.1 J 


i 
i 


IAJ246003.1 1 


i 
1 


45058441 


1 PI 9474 1 


1AL163227.2 | 


AA452578.1 | 




1 
< 


IAW291521.1 1 


1 


U91328.1 


U91328.1 


AI583811.1 1 


1 

AI21 8983.1 1 


l\/1ost Similar 
(Top) Hit 
BLAST E 


1 20E-05 


1 2.0E-051 


1 2.0E-051 


3 




CE-05 


CE-OSi 


OE-05 


OE-05 




OE-05 


OE-05 


CE-05' 


OE-05 


OE-05 




9 


CE-05 


OE-05 


1 I.OE-Os] 


1. OE-05 


1. OE-05 


] 9.0E-06| 


1 9.CE-06I 


































Expression 
Signal 


3.73 


1 2.251 






















1 6.451 




2.13] 




















ORFSEQ 
ID NO: 








1 


1 23301 1 




1 23761 


i 


24413] 




S 




§ 






27312 




27778] 


27779] 




i 


i 




i 


SEQ ID 
NO: 


1 19577 


1 19503| 




1 






1 139841 


1 14080| 


146271 






i 


16079| 


16430| 


17045] 




17220 


17554] 


17554] 




18170 


18170 


I 12504] 


i 


Probe 
SEQ ID 
NO: 


9442I 




9637 


1 


1 


3886 


4082 


4180 


4742 


1 4882 1 


i 




1 6213| 


1 65721 


§ 


1 7240 


7352 


7704 


7704 


rJ 


i 


i 


i 
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Top Hit Descriptor 






ox20g01.x1 Soares_fetalJiver_spleen_1NFLS_S1 Homo sapiens cDNA clone IMAGE:1656912 3' similar to 1 
contains Alu repetitive element; 




SUSHI REPEAT-CONTAINING PROTEIN SRPX PRECURSOR (DRS PROTEIN) (DOWN-REGULATED 1 
BY V-SRC) 1 


SUSHI REPEAT-CONTAINING PROTEIN SRPX PRECURSOR (DRS PROTEIN) (DOWN-REGULATED 

BY V-SRC) 1 




PUTATIVE SERINE/THREONINE-PROTEIN KINASE C22E12.14C | 


RC3-CT0283-201199-011-h11 CT0283 Homo sapiens oDNA | 


ANKYRIN 1 (ERYTHROCYTE ANKYRIN) 


ANKYRIN 1 (ERYTHROCYTE ANKYRIN) | 


ab90f1 0.s1 Stratagene lung (#937210) Homo sapiens cDNA clone IMAGE:854251 3' similar to contains 1 

MER20.t1 MER20 repetitive element ; 


Homo sapiens KIAA0555 gene product (KIAA0555), mRNA | 


qw16g09.x1 NCLCGAP_Ut3 Homo sapiens cDNA clone IMAGE:1991296 3' similar to contains Alu repetitivej 


EST99205 Thyroid Homo sapiens cDNA 5' end similar to EST containing LI repeat | 




601881522F1 NIH MGC 57 Home sapiens cDNA clone 1MAGE:4093972 5' | 


QV3-BT0379-01 0300-1 05-d1 1 BT0379 Homo sapiens cDNA 1 


QV3-BT0379-C10300-105-d1 1 BTC379 Homo sapiens cDNA | 


OVARIAN ABU NDANT MESSAGE PROTEIN (OAM PROTEIN) 


ox08e02.x1 Soares_fetalJiver_spleen 1NFLS SI Homo sapiens cDNA clone IMAGE: 1655738 3' similar to 1 
contains MER8.t2 MER8 repetitive element ; | 


Mus musculus E-oadherin binding protein E7 mRNA, complete cds | 


IL5-UM0070-1 1 0400-063-902 UM0070 Homo sapiens cDNA | 








EST185496 Colon carcinoma (HOC) cell line Homo sapiens cDNA 5' end | 


Top Hit 
Database 
Source 






EST HUMAN 


z 


SWISSPROT 


SWISSPROT 


z 


O 

g 


EST HUMAN | 


SWISSPROT 1 


SWISSPROT 1 


EST HUMAN 


z 


EST HUMAN | 


EST HUMAN | 


1 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


SWISSPROT 1 


EST HUMAN | 




EST HUMAN | 








EST HUMAN | 


TopHitAcessron 
No. 


i 


i 


i 

1 


AL1 63209.2 | 


1 
a 


a 


U35114.1 1 


Q10354 1 


1 


Q02357 1 


Q02357 1 


i 
i 


1 




AA385542.1 | 


AW883141.1 1 


BF21 5972.1 | 


i 


i 

I 


Q01456 J 


i 


AF1 67441.1 1 


AW801912.1 1 




AL1 63246.2 | 


U07561.1 J 


i 


Most Similar 
(Top) Hit 
BIASTE 
Value 






9.0E-06 


S 


9.0E-06 


i 

ci 


9.0E-06I 


S 


8.0E-06I 


8.0E.06| 




7.0E-06I 






7.0E-05I 


7.0E-05I 


7.0E-06I 




6.0E-05I 


6.0E-05| 


aoE-oel 


6.0E-06I 


S 


6.0E-06I 


5.0E-06| 


S 


5.0E-06I 
































s 


f2 




§ 


s 


P:. 












! 




ORFSEQ 
ID NO: 




i 


i 








i 


1 


















24900| 






1 


i 


1 






25705| 


25843 


1 


Exon 
SEQ ID 
NO: 


13473| 






i 


§ 


i 








1 






1132S| 


.J 






i 




i 








17520 


19377] 


§ 






Probe 
SEQ ID 
NO; 


i 


i 




i 




.J 






2483 


1 


i 






1 2844I 


3516 




9072 


2887 


i 


1 




i 


R 


8 


i 


.J 


781 7| 
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Top Hit Descriptor 


Iqv23f06j<1 NCLCGAP_Lvm6 Homo sapiens cDNA clone IMAGE;1982435 3' similar to contains element 1 






lzo1 7e08.r1 Stratagene colon (#937204) Homo sapiens cDNA clone IMAGE:587174 5' 1 


lz)(D4d1 1 si Scares totaljetus Nb2HF8 9w Homo sapiens cDNA clone IIWAGE:785493 3' similar to 1 
|gb.D26129 RIBONUCLEASE PANCREATIC PRECURSOR (HUMAN); | 




RC4-NT0054-120500-O12-b03 NT0054 Homo sapiens cDNA | 


[Homo sapiens ADP/ATP carrier protein (ANT-2) gene, complete cds 1 


Homo sapiens p47-pliox(NCF-;) gene, complete cds | 


[Homo sapiens p47-phax(NCF1) gene, complete cds 1 


1 Homo sapiens glypican 3 (GPC3) gene, partial cds and flanking repeat regions | 


1 Homo sapiens glypican 3 (GPC3) gene, partial cds and flanl^ing repeat regions | 


1 Homo sapiens chromosome 21 segment HS21 C081 | 


1 

i 
§ 

1 

1 

1 

i 

i 
1 


1 

i 

1 
i 

1 

1 

1 

g 


IPOL POLYPROTEIN [CONTAINS: PROTEASE ; REVERSE TRANSCRIPTASE ; ENDONUCLEASE] 1 




EST05560 Fetal brain, Stratagene (cat#935206) Homo sapiens cDNA. clone HFBEN89 


1 Homo sapiens chromosome 21 segment HS21 C080 | 


Homo sapiens membrane interleukin 1 receptor accessory protein (ILIRAP) gene, exons 10 and 11 | 


CM3-CT0277-221 099-024-e1 1 CT0277 Homo sapiens cDNA 


Homo sapiens HLA class III region containing tenascin X (tenascin-X) gene, partial cds; cytochrome P450 21- 
(Bf), and complement component C2 (02) genes.> 


IHYPOTHETICAL 24.1 KD PROTEIN IN LEF4-P33 INTERGENIC REGION | 


|7g94f07.x1 NCI CGAP Co16 Homo sapiens cDNA clone IMAGE:3314149 3' similar to TR:O75920 075920 1 
4F5L. ; 1 


|CM4-NN1029-250300-121-h12 NN1029 Homo sapiens cDNA 1 


1 

i 
1 

z 

I 

§ 

o 
z 


1 EST9361 5 Supt cells Homo sapiens cDNA 5' end | 


|wh54f1 0.x1 NCLCGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2385547 3' | 


Top Hit 

DatalDase 
Source 


|eST HUMAN 


|NT 1 


|NT 1 


|EST HUMAN | 


|est human 


iNT 1 


|EST HUMAN ] 


|NT 1 


iNT 1 




iNT 1 


INT -1 


iNT 1 


[EST HUMAN 1 


|EST HUMAN j 


[SWISSPROT 1 




|EST_HUMAN | 


iNT 1 




lEST HUMAN 1 


NT 


1 SWISSPROT 1 


EST HUMAN | 


lEST HUMAN 1 


lEST_HUMAN 1 


[EST HUMAN i 


|EST HUMAN 1 


TopHltAcession 
No. 


i 

5 


i 

i 


B 


|AA132611.1 1 


AA449257.1 


IAL163203.2 | 


i 
i 


IL78810.1 1 


1 


|AF184614.1 1 


|AF0a3529.1 1 


IAF003529.1 | 


IAL163281.2 | 


i 
< 




1P21414 1 


IAF135416.1 1 


IT07770.1 1 


|ALie3280.2 1 


AF167341.1 1 


i 


AF019413.1 


tP41479 1 


BF001867.1 1 


IAW903222.1 1 


i 
1 




i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1. OE-06 


l.0i^-06| 


1. OE-06 


l.0E-06| 


1. OE-06 


1. OE-06 


l.0E-06| 


1 .OE-Oel 


i.0E-06| 


i.0E-06| 


9.0E^7| 


9.0E-07I 


9.0E-07I 


8.0E-07| 


8.0E-07I 




s 


9 
S 




7.0E-07I 


6.0E-07 


6.0E-07 


9 


9 


6.0E-07| 




























Expression 
Signal 










3.J 












































2.9 




ORFSEQ 
ID NO: 


27013] 


27647| 












i 


21 729 1 


21 730 1 




201351 


28708 1 


243361 


24337 1 










215991 


21641 1 


22217 




27351 1 










Exon 
SEQ ID 
NO: 




17433 




1 


17488 


17839 


18720 


1 


1 


1 




10314 


1 


14547 


14547 


i 


i 


18703 


1 


11719 


1 




1 


17156 


S 
g 


i 


s 
g 


?i 


























































Probe 
SEQ ID 
NO: 


i 


1 75B2| 


1 7582] 




1 




1 8912| 


1 94401 


1 




1 357| 




1 


1 


1 


1 




1 


S 
8 


,.J 


1 18701 


2442 


1 


1 


s 




1 1041J 


1 
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Top Hit Descriptor 


j PROTEIN-ARGININE DEIMINASE TYPE IV (PEPTIDYLARGININE DEIMINASE IV)(PAD-R4) 1 
l{PEPTIDYURGINiNE DEIMINASE TYPE ALPHA) | 


oc04o10. Si NCi CGAP GOBI Homo sapiens cDNA clone iMAGE;1339890 3' 1 




Rattus norvegicus mRNA for 45 kDa secretory protein, partial | 




Homo sapiens DiGeorge syndrome critical region, telomeric end | 


Homo sapiens DiGeorge syndrome criticai region, telomeric end | 


Fugu rubripes beta-cytoplasmic(vascular) actin gene, complete cds | 


Homo sapiens homeobox protein CDX4 (CDX4) gene, complete cds and flanking repeat regions | 


Homo sapiens homeobox protein CDX4 (CDX4) gene, complete cds and flanking repeat regions 1 


zrf)8b07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 1MAGE:650869 3' similar 
to gb:L31860 GLYCOPHORIN A PRECURSOR (HUMAN);contains Alu repetitive element; 


yol Sg04.s1 Stratagene lung (#937210) Homo sapiens cDNA clone IMAGE:80790 3' similar to contains LI 1 

repetifive element ; | 


1/6 AUTOANTIGEN 1 


HYPOTHETICAL 7Z5 KD PROTEIN C2F7.10 IN CHROMOSOME 1 | 


Homo sapiens caveolin 1 (CAV1) gene, exon 3 and partial cds 1 


|xa05h07.x1 Scares NFL T GBC SI Homo sapiens cDNA clone 1MAGE:2567485 3' similar to WP:C38H2.1| 
CE00923 PROBABLE RABGAP DOMAINS ; ) 


|xa05h07.x1 Scares NFL T GBC SI Homo sapiens cDNA clone IMAGE;2567485 3' similar to WP:C38H2.1 1 
|CE00923 PROBABLE RABGAP DOMAINS ; 


Homo sapiens chromosome 21 segment HS21C101 1 


RC3-NN0066-260400-021-g11 NN0056 Homo sapiens cDNA | 


i 

i 
1 

I 
§ 

X 

i, 
1 

1 

i 


IAV729390 HTC Homo sapiens cDNA clone HTCAEG02 5' 


Homo sapiens chromosome 21 segment HS21C103 | 


CM4-NN0003-280300-1 24-e06 NN0003 Homo sapiens cDNA | 


jZnSSlill Jt5 Stratagene lung carcinoma 937218 Homo sapiens cDNA clone IMAGE:565029 3' similar to 1 
Icontains THR.b2 THR repetitive element ; 


Homo sapiens chromosome 21 segment HS21C082 | 


Top Hit 
Database 
Source 


SWISSPROT 1 


EST HUMAN 1 


i w 
















EST HUMAN 


EST HUMAN | 


[SWISSPROT 1 


ISWISSPROT 1 




EST HUMAN | 


|EST HUMAN | 


H 


|EST HUMAN | 


|=ST_HUMAN 1 


|EST_HUMAN J 




lEST HUMAN | 


EST HUMAN 




TopHitAoession 
No. 






AF029308.1 


IAJ132352.1 1 


i 

< 


IL77569.1 1 


IL77569.1 1 


1 


i 

< 


1AF003530.1 1 


AA223260.1 


i 




IQ09701 1 


IAF125348.1 1 


i 

1 


1 
1 


IAL163301.2 1 


g 
1 
1 


IAI20S715.1 1 


IAV729390.1 1 


IAL153303.2 | 


IAW892507.1 | 


AI732462.1 


|AL1 63282.2 | 


Most Similar 
(Top) Hit 
BU\STE 
Value 




1 3.0E-07 


3.0E-07 


1 3.0E-07| 


1 2.0E-07I 


I 2.0E-07I 


1 2.0E-07I 


1 2.0E-07| 


3 


1 2.0E-07| 


2.0E-07 


s 


1 2.0E-071 


1 2.0E-071 




2.OE-O7I 




2.0E-07 


2.0E-07I 


i 


1 2.0E-07I 


1 2.0E-07I 


1 2.0E-07I 




1.0E-07| 


i 1 










s 
















^ 










d 






s 




^ 






ORF SEQ 
ID NO: 


25462 




1 




i 




19941 1 


i 




1 


20701 


20702 1 


i 


1 


233341 


24723 1 


24724! 






i 






27984] 






Exon 
SEQID 
NO; 


i 


159301 


1 1 


1 


10013| 


10122| 


^ 


101461 


i 


i 


10853 


§ 


1 


1 


135471 


1 


i 


i 






S 


17478| 


17741 1 


19557 




Probe 
SEQID 
NO: 


1 5476| 




I i 


1 






9 










a 


1 11471 


1582 


1 36331 


1 5078 1 


1 50781 


1 


1 52771 




i 


s 


7891 


s 


1 
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5 

E 
1 


1 

i 
1 

z 

1 

z 

< 


1 Rat mRNA for ribosomal protein L31 | 


DYNEIN HEAVY CHAIN (DYHC) 


1 
Z 
< 

5 

1 


|cong3.P11.A5 conorm Homo sapiens cDNA 3' | 


[Raftus norvegicus Munci 3-1 mRNA, complete cds 


IDYNEIN HEAVY CHAIN (DYHC) | 


iDYNEIN HEAVY CHAIN (DYHC) j 


IHomo sapiens SCL gene locus | 




iHomo sapiens chromosome 21 segment HS21C048 | 


MR0-HT0166-191199-004-g09HT0166 Homo sapiens cDNA | 


Homo sapiens chromosome 21 segment HS21 C048 | 


IRETROVIRUS-RELATED POL POLYPROTEIN [CONTAINS: REVERSE TRANSCRIPTASE ; 1 
lENDONUCLEASE] | 


Homo sapiens chromosome 21 segment HS21C009 | 




,nh03b09.s1 NCI_CGAPJ"hy1 Homo sapiens cDNA clone 1MAGE:9431 93 similar to contains Alu repetitive 1 

lelement 


jALDEHYDE OXIDASE | 


COMPLEMENT C2 PRECURSOR (C3/C5 CONVERTASE) | 


|QV0-CT0225-131099-034-a12 CT0225 Homo sapiens oDNA | 


DORSAL-VENTRAL PATTERNING TOLLOID PROTEIN PRECURSOR | 


[DORSAL-VENTRAL PATTERNING TOLLOID PROTEIN PRECURSOR j 


f 
1 

< 
E 

1 

1 
O 


an22d10.x1 Gessler Wilms tumor Homo sapiens cDNA clone IMAGE;1699411 3' similar to contains Alu 1 
repetitive element;contains element MER22 repetitive element ; 


Homo sapiens mRNA for UGA suppressor tRNA-associated antigenic protein (tRNA48 gene) 1 


'602248024F1 NIH_MGC_62 Homo sapiens cDNA clc^e IMAGE:4333300 5' | 


I602248024F1 NIH MGC 62 Homo sapiens cDNA clone IMAGE:4333300 5' | 


zri65g03.rl SDares_fetal_hBart_NbHH19W Homo sapiens cDNA clone IMAGE:345556 5' similar to contains 1 
L1-t1 LI nepeHtive element; 


tb95a1 1 .x1 NCI_CGAP_Co1 6 Homo sapiens cDNA clone IMAGE:2062076 3' similar to contains MER1 8.b3 1 
MER18MER13repeti«veelement; | 


Top Hit 
Database 
Source 


ISWISSPROT 1 




ISWISSPROT J 


ISWISSPROT 1 


|EST_HUMAN | 




ISWISSPROT 1 


ISWISSPROT 1 








1 


Z 


SWISSPROT 1 


z 




EST HUMAN 


ISWISSPROT 1 


ISWISSPROT 1 


|EST_HUMAN | 


ISWISSPROT 1 


ISWISSPROT 1 




1 


z 




|EST HUMAN j 


EST HUMAN 




TopHitAcession 


i 


i 

o 




IP15305 1 


IP15305 


|AI535743.1 J 




IP15305 1 


1 


131016.1 1 


i 


163248.2 1 


i 


i 

8 


8 






i 


IQ06278 1 


1 


IAW851878.1 1 


IP25723 1 




IL42571.1 1 


§ 
< 


3 


IBF692493.1 | 


i 

§ 


W76159.1 


AI343353.1 | 


< 




ii 










< 


Most Similar 
(Top) Hit 


BLAST E 
! Value 


r.0E-08| 


r.OE-OS! 


^0E-08| 


r.0E-08| 


r.oE-osI 


S 


r.OE-osI 


r.0E-08| 


r.0E-08| 


6.0E-08I 


6.0E-08I 




i 


8 


6.0E-08I 


5.0E-08I 


5.0E-08I 


i 


5.0E-08I 


5.0E-08I 


4.0E-08I 


4.0E-08I 


8 




4.0E-08| 


4.0E-08I 


i 


4.0E-08 


4.0E-08I 






















1 














S 












i 














? 












1 








ORF SEQ 


O 


1 




? 






i 


? 






§ 


1 20570 1 


i 




1 




1 








1 253171 




i 




§ 




1 285701 








|ag 


100571 


112441 


13444 


13444 


§ 






13444 


19334 


10729 


i 


12193 




18534 


i 


10063 




15042 






11625 


11625 


17155 




18154 


18313 


18313 




19252 




























































Probe 
SEQ ID 
NO: 




1 1338 


1 3528 


1 3528| 




a 


1 


1 9770 1 


S 


1 


i 
















E; 
S 




1 17241 


1724 






I 8274| 


1 8439| 




i 


i 
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WO 01/57274 PCT/USOl/00666 



Table 4 

bes Expressed in Heart 


ii 

0 o 
ii 
II j 

1 11n 

i s s 1 

^ 1 
§81 

iii 

1 i 1 

ft! 

III 


s ill 1 ? ^ 

G e a g E ™ 2 -i . 

§ is^ i II 

1 i z 12 Iii 1 1 P 
£^jOffii Sf^si 

i i iSs ii 8 i|a 1 1 i| 

; i ISl ill iii ill <fi i ii 
1. ill ilil If II 111 1 1 
i "1 If ii i fti i fi 

|i j's l,S<-||i|iil 1 Ii 
II s'ii ifliiifilfi si! 1 i! 
If s|i |i,ifsgili|l ssiij 1 y 

M tilJji'iiillli,'l-liiiil'J,'.|l 

ill miuiummmmmui 

Ill iiillilliiiiiliiiiiilililli 


ngle Exon Pre 


Top Hit 
Database 

EST HUMAN 
EST_HUMAN 


ii i ii i iii ii l iii i m 


55 


U it 

-° is 


sgi i ii §iis|^|8 spools i 




MostSimrIa 
(Top) Hff 
BLAST E 
Value 

1 3.0E-0i 
3.0E-0J 


III 1 ii iliiiili uml I iff 




e .o) 
it" 


3.5£ 

I. 41 

2.97 

II. 6 
15.76 

7,97 
1.48 
10.93 
10.93 
37.42 
1.93 
1.03 
3.99 

1.42 
2.16 
6.02 
6.02 
1.66 
0.78 

3.35 

2.89 
1.31 
11.62 






iii i i ii 1 ii 




Sag g g 1 


1512C 
1631: 

16416 

18819 
10171' 

10194 
10431 
10581 
10581 
10898 
11226 
11608 
11708 

12248 
12371 
13096 
13096 
13695 
13893 

14203 

14756 
16602 
18799 






6163 
6451 

6558 

9027 
199 

223 
48i^ 
644 
644 
974 

1707 
1811 

2495 
3171 
3171 
3783 

4305 

4876 
6722 
8995 
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Top Hit Descriptor 


POL POLYPROTEIN [CONTAINS: REVERSE TRANSCRIPTASE ; ENDONUCLEASE] 


PM2-HT0130-150999-001-f12 HT0130 Homo sapiens cDNA | 


sapiens cDNA clone TCBAP5232 | 


sapiens cDNA clone TCBAP5232 | 


Homo sapiens chromosome 21 segment HS21C080 | 




1 

5 

1 
< 

1 

! 

I 

1 








MR4-ST0240-240700-013-g04 ST0240 Homo sapiens cDNA ( 


Homo sapiens chromosome 21 segment HS21C079 | 


Homo sapiens chromosome 21 segment HS21C079 | 


RC6-BN1058-270400-031-C06 BN1058 Homo sapiens cDNA | 


contains MSRI.tl MSR1 repetitive eiement ; 


op74d08.s1 Scares NFL T GBC S1 Homo sapiens cDNA clone IMAGE:1582575 3' | 






Human lysosomal membrane glycoprotein-2 (LAMP2) gene, 5' end and flanking region | 




DKFZp434C0514_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434C051 4 5' 1 


nl17a11.s1 NCIjCGAP HSC1 Homo sapiens cDNA clone IMAGE:1 040924 similar to contains L1 .12 LI 1 
repetitive element; 


PM1-HT0527-160200-001-h05HT0527 Homo sapiens cDNA 1 


xn85h08.x1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:270131 1 3' | 




Homo sapiens testis-specific kinase substrate (TSKS) gene, complete cds | 


RC2-HT0252-1 20200-01 4-h1 0 HT0252 Homo sapiens cDNA | 


Top Hit 
Database 
Source 


SWISSPROT 


EST HUMAN | 


EST HUMAN | 


1 

^1 






1 








EST HUMAN | 






EST HUMAN | 


i '■ 


EST HUMAN 








EST HUMAN | 


EST HUMAN 1 


i 
z> 

^, 


EST HUMAN 1 


EST HUMAN | 






EST_HUMAN | 


TopHitAcession 
No. 


P31792 


BE141959.1 1 






AL163280.2 | 


AJ010770.1 1 


AI01 5304.1 1 


AF044083.1 | 


i 


X51755.1 1 


BF375398.1 | 


AL163279.2 | 


AL163279.2 | 


BE01 2075.1 1 


AM 83500.1 


il 


1 

8 


i 

Q 


i 
§ 


BE254850.1 | 


AL040439.1 1 


1 
1 


BE169421.1 1 


1 




AF200923.2 1 


BE1'i9264.1 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1.0E-08I 


1.0E-08I 


s 


1.0E-08 


1.0E-08I 


1.0E-08I 


1.0E-08I 


9 


1 


s 


s 


s 


9.0E-09' 


8.0E-091 


SS 




7.0E-09 


7.0E-091 


7.0E-09I 




3 


6.0E-09 


6.0E.09! 


6.0E-09I 


6.0E-09 


6.0E-091 




Expression 
Signal 




a 




















i 




















9 










§ 


ORF SEQ 
IDNO: 


212571 




i 


22882I 


1 


254141 


270451 


28773] 








s 


23824 1 










24236 1 


274031 


27945 1 




S 

S3 


24562 










Exon 
SEQID 
NO: 


1 


1 


13080 


13080 


5054 


5358 




1 




§ 




i 


§ 


3361 








1 


7203 


1 


i 






5226 


e 


7746 


1 
























































Probe 
SEQID 
NO: 


1493 


2005| 




.J 


51911 


1 


8 


i 


94341 


i 


i 


41491 




3444 


§ i 


7196 


i 


1 


73351 


7850 


21071 


i 


§ 


5305 


7294 


i 


1394 



257 




258 
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Top Hit Descriptor 


MER1 2 repetitive element; | 


Homo sapiens basic transcription factor 2 p44 (btf2p44) gene, partial cds, neuronal apoptosis inhibitory j 
protein (naip) and survival motor neuron protein (smn) genes, complete cds | 


Homo sapiens nucleolar phosphoprotein B23 (NPM1 ) mRNA, complete cds 1 


Homo sapiens nucleolar phosphoprotein B23 (NPM1) mRNA, complete cds | 


MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE AND NON-MUSCLE ISOZYMES (MLCK) 1 
[CONTAINS: TELOKIN] | 


601058602F1 NIH_MGC 10 Homo sapiens cDNA clone IMAGE:3445177 5' | 


zh35bD3.s1 Soaros_pinoal _gland_N3HPG Homo sapiens cDNA clone IMAGE:414029 3' similar to contains | 
Alu repetitive element;contains element MER22 repetitive element ; | 


Human breakpoint cluster region (BCR) gene, complete cds ( 


CIRCUM3P0R0Z01TE PROTEIN PRECURSOR (CS) | 


Homo sapiens chromosome 21 segment HS21 C083 | 


Homo sapiens GTP binding protein 1 (GTPBP1 ), mRNA | 


MR0-SN0040 050500-002-C07 SN0040 Homo sapiens cDNA | 


|We78h03 j(1 Soares_Dieckgraefe_colon_NHCD Homo sapiens cDNA clone IMAGE:2347253 3' similar to 
SW:RL29_HUMAN P47914 60S RIBOSOMAL PROTEIN L29 ;contains element PTR5 repetitive element ; 


tj46b09.x1 Scares NSF F8 9W OT PA P S1 Homo sapiens cDNA clone IMAGE:2144537 3' similar to 1 
TR:O00372 000372 PUTATIVE P1 50. ; | 


Homo sapiens MCM4 (MCM4) and DNA-PKcs (PRKDC) genes, partial cds | 


QVI -BT0631 -1 50200-071 -fOI BT0631 Homo sapiens cDNA | 


EST89534 Small Intestine 1 Homo sapiens cDNA 5' end | 




Homo sapiens chromosome 21 segment HS21 C080 | 




Homo sapiens TPA inducible protein (LOC51 586), mRNA | 


LYSP100 PROTEIN (LYMPHOID-RESTRICTED HOMOLOG OF SP100) | 


LINE-1 REVERSE TRANSCR!PTASE HOMOLOG | 


LINE-I REVERSE TRANSCRIPTASE HOMOLOG | 


Ksapiens DHFR gene, exon 3 


EST51247 Gall bladder II Homo sapiens cDNA 5' end | 


DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT | 


Top Hit 
Database 


EST HUMAN | 








SWISSPROT 1 


5 


EST HUMAN J 


NT 1 


SWISSPROT 1 


NT 1 


NT 




? 


EST HUMAN | 


NT J 


EST HUMAN | 


EST HUMAN | 


NT__ 




NT 1 


NT ] 


SWISSPROT J 


SWISSPROT ] 


SWISSPROT ] 




EST HUMAN 1 


SWISSPROT ] 


TopHitAcession 
Ho. 


< 


U80017.1 1 


1 

s 




PI 1799 1 


BE535440.1 | 


1 


U07000.1 1 




AL163283.2 | 


11418127 


AW867740.1 1 


< 




i 
§ 


BE080748.1 | 


AA376832.1 | 


U36308.2 1 


AL1 63280.2 | 


77062251 


7706225] 


01 3342 ] 


i 

£ 


P08547 ] 


i 


AA345220.1 ] 


P35084 ] 


Most Similar 
(Top) Hit 
BLAST E 
Value 


I.OE-O9I 








I.OE-O9I 


1.0E-09I 


3 


1.0E.09 




1.0E.09I 


1.0E-09I 


9.OE.I0I 




9.OE-I0I 


8.0E-101 


8.OE-I0I 


8.OE-I0I 




8.0E-10 


7.OE-IOI 


7.0E-101 


7.OE-IO] 


7.0E-10I 


7.0E-10] 






7.OE-IO] 


Expression 
Signal 


O.94I 








^ 




^. 


1-37| 
















o' 








12.45| 


12.45] 












1.39] 


ORFSEQ 
ID NO; 




2258o| 


i 




22674I 


227221 




25549| 


257491 




i 


21048| 




261 72I 




23022] 






291 00 1 


20444] 


20445] 








i 






Exon 
SEQ ID 
NO; 




1 127881 




1 


12876| 










177611 








1 


101151 


132221 


§ 






i 


i 




1 


s 






16373] 


Probe 
SEQID 
NO: 


1 2453 




1 


28951 


1 


.30021 


1 


1 


57361 


79111 


1 


1237| 


2803 














§ 


i 


1 




2512' 


i 


57541 


651 4| 
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o 

1 

I 

g- 


iHomo sapiens ASCL3 gene, CEGP1 gene, C11orf14 gene, C11orf15 gene, C11orf16 gene and 01 1orf17 1 
gene 


tf02d07.x1 NCI_CGAP_Pr28 Homo sapiens cDNA clone IMAGE:2095021 3' 


|RC3-CT0254-031099-012^12 CT0254 Homo sapiens cDNA | 


1 EST38401 2 MAGE resequences, MAGL Homo sapiens cDNA | 


\ RC3-NN0070-1 1 0800-014-h07 NN0070 Homo sapiens cDNA | 


|DKFZp434N219_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434N219 5' | 




|501822184F1 NIH MGC 75 Homo sapiens cDNA clone IMAGE:4042413 5' | 


IHYPOTHETICAL 67.9 KD PROTEIN ZK688.8 IN CHROMOSOME 111 1 


[HYPOTHETICAL 67.9 KD PROTEIN ZK688.8 IN CHROMOSOME III | 


|qg09f09.x1 Soares_pIacenta_8to9weekE_2NbHP8to9W Homo sapiens cDNA clone 1MAGE:1759049 3' 1 
similar to contains LTR8.b2 LTR8 repetitive element ; 


|hg58g03.x1 NCI CGAP GC6 Homo sapiens cDNA clone IMAGE:2949844 3' similar to contains Alu 1 
jrepetitive element; | 


Homo sapiens ctiromosome 21 segment HS21 CI 03 | 


Homo sapiens mannosidase, beta A, lysosomal (MANBA) gene, and ubiquitin-conjugating enzyme E2D 3 1 
(UBE2D3) genes, complete cds 




|yy32f06.s1 Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:272963 3' similar to contains j 


Homo sapiens extracellular glycoprotein lacritin precursor, gene, complete cds 1 


8 

1 
1 
1 


Homo sapiens chromosome 21 segment HS21 C003 | 


z 

i 
1 

Q 
O 

o 


o 

§ 
< 

Q 

g 

! 

X 

i, 

E 


|AV743302 CB Homo sapiens cDNA clone CBFBGD08 5' | 


|AV743302 CB Homo sapiens cDNA clone CBFBGD08 5' | 


ys74b12.s1 Scares retina N2b4HR Homo sapiens cDNA clone IMAGE:22051 1 3' similar ta contains MER29 1 
repetitive element; 


1L3-CT021 9-1 60200-064-B06 CT021 9 Homo sapiens cDNA 1 


IL3-CT0219-160200-064-B06 CT0219 Homo sapiens cDNA 1 


1 

1 
■1 

i 

s 
1 

1 


Top Hit 
Database 
Source 




lEST HUMAN 1 




|EST HUMAN | 


[EST HUMAN 


|EST HUMAN | 


ISWISSPROT 1 


|EST HUMAN | 


ISWiSSPROT 1 


ISWISSPROT 1 


EST_HUMAN | 


EST HUMAN | 


z 


z 


z 


EST HUMAN | 




z 




1 

5 


|EST HUMAN | 


|EST HUMAN | 


lEST HUMAN | 


EST HUMAN 


lEST HUMAN 1 


[EST HUMAN 1 


|EST_HUMAN | 


Top HitAcession 
No. 


1 


|AI424405.1 1 


|AW853719.1 1 


i 
1 


1 




IQ01033 1 


i 


IP34678 1 




AI221083.1 


AW594709.1 1 


IAL163303.2 | 


AF224669.1 | 


AF003528.1 | 


i 


IAY005150.1 1 


i 
< 


< 




iBE302970.1 | 


1 
> 


1 
< 


jH87208.1 


g 
< 


|AW 850731.1 1 


'T65891.1 1 


Most Similar 
(Top) Hit 
BLAST E 


s 




2 








5 




5 














S.OE-lol 




: 




















.1 _ 

P 
















« 
































" 








ORFSEQ 
ID NO: 


1 






















i 


1 22292^ 










i 




g 


1 258511 


§ 


1 263001 










■ Exon . 
SEQ ID 
NO: 


i 




1 


i 


1 


i 


1 






1 


1 


1 










i 










i 




8 








Probe 
SEQ ID 
NO: 


i 


i 


1 


s 


i 


5 


1 




1 


1 




1 


i 


1 


1 




i 


1 


1 


1 




i 


i 


& 


i 




1 
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Top Hit Descriptor 


a 

a. 
1 

1 
§ 

^ 
i 
1 


DKFZp547D225_r1 547 (synonvm: hfbri ) Homo sapiens cDNA clone DKFZp547D225 5' | 


DKFZp547D225 r1 547 (synonym: hfbri ) Homo sapiens cDNA clone DKFZp547D225 5' | 


1 
1 

Q 

1 
< 

i 

1 
1 

s 

i' 

Q 

S 

i 

o 


DKFZp547D225_r1 547 (synonym: hfbri) Homo sapiens cDNA clone DKFZp547D225 5' | 


ae78f01 .s1 Stratagene schizo brain S1 1 Homo sapiens cDNA clone IMAGE:970297 3' 1 


RC6-BT0S27-1 40200-01 1 -E05 BT0627 Homo sapiens cDNA | 


CI 5635 Clontech human aorta polyA+ mRNA (#6572) Homo sapiens cDNA clone GEN-506B08 5' | 


yn53f1 1 .si Scares adult brain N2b5HB55Y Homo sapiens cDNA clone 1MAGE:1 721 73 3' similar to contains 1 
LI repetitive element ; | 


tm54c0g.x1 NCI CGAP Kidll Homo sapiens cDNA done 1MAGE:21 61 936 3' | 


i 

o 
1 

1 
< 

Q 

1 

1 
t 

1 
1 
O 

j 


EST34392 Embryo, 6 week 1 Homo sapiens cDNA 5' end 


Homo sapiens SNCA isoform (SN CA) gene, complete cds, alternatively spliced 1 


RETROVIRUS-RELATED POL POLYPROTEIN [CONTAINS: REVERSE TRANSCRIPTASE ; 1 
ENDONUCLEASE] 


AV701 656 ADB Homo sapiens cDNA clone ADBABC09 5' ■ | 


Human matrix Gla protein (MGP) gene, complete cds | 


Human matr'ix Gla protein (MGP) gene, complete cds | 


LINE-1 REVERSE TRANSCRIPTASE HOMOLOG | 


AV727859 HTC Homo sapiens cDNA clone HTCASCC6 5' | 


Homo sapiens chromosome 21 segment HS21 CC83 1 




ALDEHYDE OXIDASE | 






Homo sapiens KIAA0851 gene (partial), XT3 gene and LZTFL1 gene | 


i 

1 

< 

1 

s 

1 
1 

1 

i' 
1 


601507531F1 NIH_MGC_71 Homo sapiens cDNA clone IMAGE:3909295 5' 


HUMSUPY059 Human brain oDNA Homo sapiens cDNA clone 069 | 


PRE-MRNA SPLICING FACTOR RNA HELICASE PRP2 | 


(UBE2D3) genes, complete cds 


RC1-HT0256-21C100-013-f08 HT0256 Homo sapiens cDNA | 


Top Hit 
Database 
Source 


EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


EST HUNMN | 


EST HUMAN | 


EST HUMAN | 


EST_HUMAN | 


a 


1 


1 


EST HUMAN | 




SWISSPROT 


EST HUMAN | 


1- 




SWISSPROT 1 


EST HUMAN | 






SWISSPROT 1 








EST HUMAN | 




EST HUMAN | 


SWISSPROT 1 




EST_HUMAN | 


Top HitAcession 
No. 
















1 


|h19971.1 1 


i 
< 


1 


|AA330642.1 | 


i 

< 


P1 1369 


|AV70 1555.1 1 


|M55270.1 1 


B 


IP08S47 1 


IAV727859.1 | 


i 
I 


IAL163283.2 | 


IP4S034 1 


IAL163213.2 | 


§ 




|AA436042.1 | 


1 

s 




IP20095 1 


i 


IBE149425.1 | 




34395.- 


34395.- 


34395.- 




|AA775985. 


|BE0797a0.- 




^ 


< 


^ 




^ - Ul 
ot-m 










































5,0E-r 






















Expression 
Signal 






























































































































ORF SEQ 
ID NO: 


200421 


218431 


21844| 


230641 




24079 




i 




23580 1 


23641 1 


212021 


27020 












i 


19788| 


1 


% 




264721 


i 




225141 


1 


25927| 






Exon 
SEQ ID 
NO: 


0226 


1947 




3257 


1 


1 




1 


3005 


3792 


1 


1336 


6827 




9145 






8 




i 


1 


4035 


i 


1 




1 




4411 


5803 


1 


7272| 
































































Probe 
SEQ ID 
NO: 




2057 


2057| 




3337 


44oo; 


5421 


i 


30781 


• 3831] 


395/1 






7873 




1 


1 


i 






3320 


4135| 


1 


1 




i 


27601 








7405| 
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if 



IS 



ill 



ill 

! IS i 



IS 

li 
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Top Hit Descriptor 


LINE-1 REVERSE TRANSCRIPTASE HOMOLOG 




Homo sapiens SCL gene locus 1 




Homo sapiens PRO3078 mRNA, complete cds | 




f 
1 
S 

i 




E 
I 

a 

E 
Q 

E 


if73d08.r1 Soares infant brain 1 NIB Homo sapiens cDNA clone IMAGE:28166 5' 1 


QV4-NN1 1 49-250900-423-a03 NN1 1 49 Homo sapiens cDNA ( 


QV4-NN1 1 49-250900-423-a03 NNl 1 49 Homo sapiens cDNA ( 


5021 54a07F1 N IH_MGC_83 Homo sapiens cDNA clone IMAGE:4295977 5' 


HSAAACADH P, Human foetal Brain Whole tissue Homo sapiens cDNA | 


PREGNANCY ZONE PROTEIN PRECURSOR 




Homo sapiens chromosome 21 segment HS21 CI 00 1 


Homo sapiens Xq pseudoautosomal region; segment 2/2 | 


34 KD SPICULE MATRIXPROTEIN PRECURSOR (LSM34) | 


i 
1 

< 

O 

i 
t 

1 
i 

i 
1 

I 

J 


AV730554 HTF Homo sapiens cDNA clone HTFAWF06 5' | 


nz88f11.s1 NCLCGAP_GCB1 Homo sapiens cDNA clone IMAGE:1302573 3' similar to contains Alu 1 
rep^-five element; 


Morone saxalllis myosin heavy chain FM3A (FM3A) mRNA, complete cds ( 


Od10g11.s1 NCI CGAP GCB1 Homo sapiens cDNA clone IMAGE:1367588 similar to contains MER29.t2 ( 
IVlER29repeStive element; | 


EST04462 Fetal brain, Stratagene (cat#935206) Homo sapiens cDNA clone HFBDV33 | 


tz42b05.y1 NCLCGAP_Brn52 Homo sapiens cDNA clone IMAGE:2291 21 7 5' 1 




i 

i 
1 
1 

•1 
1 

o 

8 

z 
1 


Homo sapiens chromosome 21 segment HS21C078 | 


Homo sapiens chromosome 21 segment HS21 C078 


EST386850 MAGE resequences, MAGN Homo sapiens cDNA | 


Homo sapiens Xq pseudoautosomal region; segment 1/2 | 


Top Hit 
Database 
Source 


SWISSPROT 1 












EST HUMAN 1 






EST HUMAN ] 


EST HUMAN | 


1 

^1 


EST HUMAN | 


EST HUMAN | 


SWISSPROT 1 








SWISSPROT 1 


EST HUMAN 1 


EST HUMAN 1 


1 




|esT HUMAN 1 


EST HUMAN | 


EST HUMAN 1 




EST HUMAN | 






EST HUMAN | 




TopHitAcession 
No. 


IP08547 


1 


)AJ131016.1 


lALI 63279.2 


< 


< 


i 


|AL153247,2 


1 48855461 


IR13174.1 J 


|BF365119.1 1 


|BF365119.1 . 


i 

1 


IZ20377.1 1 


|P20742 1 


.|AL163300.2 ] 


IAL163300.2 | 


|AJ271 736.1 1 


1 
§ 


IAA704735.1 1 


|AV730554.1 | 




IAF003249.1 | 


i 
1 


IT06573.1 1 


|BE047779.1 1 


IAJ271 736.1 1 


1 


i 

i 

< 


IAL163278.2 1 


S 
< 


IAJ271735.1 1 


J Ul 

.S ^ H » 

im* 


























































































Expression 
Signal 






















































8 












ORFSEQ 
ID NO: 








209541 






231741 


1 


26840 1 


27060 1 


§ 


1 






226491 














23925 


272631 








23367| 




256761 


256771 




1 


Exon 
SEQ ID 
NO: 


1 


9401 


i 






i 


3370 


5191 


i 






7109 


8474 1 


1 


i 


7491 




I 




1 




14151 


7078 


17262 


g 






5037 


5578 


5578 






































































Probe 
SEQ ID 
NO: 


9627| 


1 


§ 


1199| 


! 




3454| 


i 


6773 






7232 


§ 


9702| 


i 


1 


i 


9270 


4562 


1 


3500 


J 


72011 


„J 


g 


33441 


i 


5171J 


i 






7268| 
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266 
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Top Hit Descriptor 


Homo sapiens chromosome 21 segment HS21C078 | 


CELL SURFACE GLYCOPROTEIN 1 PRECURSOR (OUTER LAYER PROTEIN B) (S-LAYER PROTEIN j 


1 
0 

1 
1 
1 

1 
.1 

i 
1 

E 


Homo sapiens mab-21 (C. elegansHike 1 (MAB21 Ll ) mRNA | 


s 
s 
s 

1 

1 

1 

§ 

S 
0 


FGF-1 =fibroblasi growth factor 1 [human, Itidney, Genomic, 342 nt, segment 2 of 2] | 


1 
Q 
1 

i 


1 

i 
E 

i 
g 

a 
1 

SI 
3 

Si 
3 

i 

1 

1 i 


nw21 g02.s1 N Cl_ CGAP^GCBO Homo sapiens cDNA clone ll*\GE:1241 1 38 3' similar to contains THR.t3 1 
THR repetitive element ; ( 


502038009F1 NCi_CGAP_Bm64 Homo sapiens cDNA clone IMAGE:41 85866 5' | 


7l45el0.x1 Scares NSF F8_SW_0T_PA P_S1 Homo sapiens cDNA clone IMAGE;3524443 3' similar to 1 
contains MER29.b2 MER29 repetitive element ; | 


AV715377 DCB Homo sapiens cDNA clone DCBA1ED3 5' | 


Homo sapiens Xq pseudoautosomal region; segment 1/2 | 


aj24c01 .si Soares_testis_NHT Homo sapiens cDNA clone 1391232 3' similar to contains MERIS.tl MER19 1 
repetitive element ; | 


aj24o01 .si Soares_testis_NHT Homo sapiens cDNA clone 1391232 3' similar to contains MERIS.tl MER19 1 
repetitive element ; 1 


RC4-CT0322-0801 00-01 3-d09 CT0322 Homo sapiens cDN A | 


Homo sapiens TFF gene cluster for trefoil factor, complete cds ( 


xo54h05.x1 NCI CGAP Uti Homo sapiens cDNA clone IMAGE:2707833 3' | 


aj24c01 .si Soares_testis_NHT Homo sapiens cDNA clone 1 391232 3' similar to contains MERIS.tl MER19 1 
repstitivs element i 1 


Human DNA, SINE repetitive element | 




hz71c09.x1 NCl_CGAP_Lu24 Homo sapiens cDNA clone 1MAGE:321 3424 3' | 


yi72e03.r1 Soares placenta Nb2HP Homo sapiens oDNA clone 1MAGE;1447963' | 


H .sapiens DNA for endogenous retroviral like element | 


zq17c10.s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IWIAGE:629970 3' | 


QV2-BT0258-261 099-01 4-a01 BT0258 Homo sapiens cDNA | 


Top Hit 
Database 
Source 


NT 1 


SWISSPROT 1 


NT 1 


NT 1 


EST_HUMAN | 


NT 1 


NT 1 




EST HUMAN | 




|est human 1 


EST HUMAN | 


NT 1 


EST HUMAN 


1 

X 


EST HUMAN 1 


NT 1 


2 


EST HUMAN 


NT 1 


NT 1 


EST HUMAN | 


EST HUMAN J 


z 


EST HUMAN | 


EST HUMAN | 


TopHitAcession 
No. 


i 

< 


1 




S 

i 


i 

1 
1 


i 


g 
< 


X87344.1 1 


i 


s 


BF108755.1 1 


AV71 5377.1 1 


AJ271 735.1 1 


IAA781 159.1 J 


'aA781 159.1 1 


AW8S1 577.1 1 


1 


AW51329Q.1 1 


|AA781 159.1 


1 
Q 


1 




1 


X89211.1 1 




BE062558.1 | 


Most Similar 
(Top) Hit 
BUVSTE 
Value 


20E-13I 


2.OE-I3I 


20E-13I 


20E-13| 




1.0E-13| 




I.OE-I3I 








1,0E-13| 


1.0E-13I 






9.0E-14I 




9.0E-14I 




9.0E-14I 


9,0E-14| 




8.0E-14I 


8.0E-14| 


8.0E-14I 


8,0E-14| 


Expression 
Signal 




3.87 


6^ 




_ 7.42 


















J 


J 


4041 


4.62| 












2.77 




? 


4.39| 


ORF SEQ 
ID NO: 




25732! 




1 




200721 




' 2IO77I 


21761 1 


1 


1 






20105 






224821 


1 
Si 






243251 






i 


2751 0| 




Exon 
SEQID 
NO: 


139231 




§ 




1 


0252 


i 


11220 


1 






5? 
Eg 


9255 


1 




2328 


2587 


i 




1 


4536 


i 


3783 


i 


7303 


i 




















































Probe 
SEQ ID 
NO; 




5722 




1 


i 




1 


1 1314 


1 


4488 1 


1 




9714 


8 




24511 


2725 


30731 


3200 


37281 


45501 




3872! 


74341 


751 5 1 


87321 
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II 

Ml 
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1 

8 










1 


























intainsMERig.tl 


1 




1 
E 
1 


1 

M 






s 










1 














milar 












■R 








Top Hit Descriptor 


)Ci77d02.x1 Scares NFL.T_GBC_S1 Homo sapiens cDNA clone IMAGE:27004B3 3' simila 
THR.t2 THR repetitive element ; 


[Homo sapiens Xq pseudoautosomal region; segment 2/2 


1QV1-LT0036-150200-070-C10 LT0036 Homo sapiens cDNA 


|nab81c12.x1 Soares_NSF_Fe__9W_OT_PA_P_S1 Homo sapiens cDNA clone IMAGE: 3' 


[Homo sapiens chromosome 21 segment HS21 COOS 


(HLA+t) gene, RoRet gene, and sodium phosphate transporter (NPT3) gene, complete cds 


|UI-h-BW0-ajb-g-10-0-Ul.s1 NCI_CGAP_Sub6 Homo sapiens cDNA clone IMAGE.273121S 


|AV730056 HTF Homo sapiens cDNA clone HTFAVE06 5' 


Homo sapiens chromosome 21 segment HS21 CI 03 


|DKFZp751C0810_r1 761 (synonym: hamy2) Homo sapiens cDNA clone DKFZp761C0810 ; 


Homo sapiens mRNA for transcription factor 


1 Homo sapiens mRNA for transcription factor 


1 LY1 1 42F Human fetal heart, Lambda ZAP Express Homo sapiens cDNA clone LY1 142 5' si; 
ANF(CARDIODIUTIN) 


1 NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 


|7P01 F03 Chromosome 7 Placental cDNA Library Homo sapiens cDNA clone 7P01 F03 


1 7P01 FC3 Chromosome 7 Placental cDNA Library Homo sapiens cDNA clone 7P01 F03 


1 Mus musculus ultra high sulfur keratin gene, complete.cds 


|Mus musculus ultra high sulfur keratin gene, complete cds 


1 oc36a07, si NCI_CGAP_GCB1 Homo sapiens cDNA clone 1MAGE:1 351 764 3' similar to cc 
MER1 9 repetitive element ; 


iHomo sapiens DNA, DLEC1 to 0RCTL4 gene region, section 1/2(DLEC1, ORCTL3, ORC- 
1 complete cds) 


iHomo sapiens calcium channel aiphalE subunit (CACNA1 E) gene, exons 7-49, and partial < 
[spliced 


1 Homo sapiens calcium channel aiphal E subunit (CACNA1 E) gene, exons 7-49, and partial ( 
Ispliced 


iHomo sapiens calcium channel aiphalE subunit (CACNA1 E) gene, exons 7-49, and partial < 
Ispliced 




Top Hit 
Database 
Source 


EST_HUMAN J 




lEST HUMAN | 


[est human 1 


1- 
z 




[est human 1 


[est human I 




|EST HUMAN | 






EST HUMAN J 


1 


lEST HUMAN 1 


|EST HUMAN J 






IeST HUMAN 1 


z 


z 




z 


z 


TopHitAcession 
No. 


i 


|AJ271 736.1 1 


i 
i 




lAL1 63208.2 | 


U91 328.1 


g 


IAV730055.-I 1 


i 

8 


|AL1 18596.1 1 


|AJ1 30894.1 1 




N89452.1 




AA078097.1 


IAA078097.1 1 


|M27685.1 1 


IM27685.1 1 


i 


i 

s 


|aF223391.1 | 


AF223391.1 


|aF223391.1 


i 


Most Similar 
(Top) Hit 
Bl^STE 
Value 








P 


Q 


i 




§ 




P 




4.0E-15 
























1 2.0E-15! 




s 






























§ 


















Expressic 
Signal 
















































ORF SEQ 
ID NO: 




g 














3779 


g 


§ 


i 






i 


8 








i 






















































Exon 
SEQ ID 
NO: 


■ 17813 


i 


1 197701 




10350 




s 


§ 




1 


i 


1 


14023 












17559 






10320 


1 10320 




Probe 
SEQ ID 
NO: 


1 


1 


i 


1 


9 


2733 


3423 


g 






i 


8414 


4123 


4843 




4955 


s 




7709 




g 


i 


S 
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Top Hit Descriptor 




Homo sapiens calcium channel alphalE subunit (CACNA1 E) gene, exons 7-49, and partial cds, alternatively 1 


Iwf07f06.x1 Soares NFL T GEO SI Homo sapiens cDNA clone IMAGE:2349923 3' similar to TR:Q61043 1 
IQ61043N1NEIN.; | 


i REPETITIVE PROLINE-RICH CELL WALL PROTEIN 2 PRECURSOR | 


.REPETITIVE PROLINE-RICH CELL WALL PROTEIN 2 PRECURSOR | 


Homo sapiens ASCL3 gene, CEGP1 gene, C1 1orf14 gene, C11orf15 gene, Cl 1orf16 gene and C11orf17 1 
gene | 


i 
i 
§ 

a 
1 

X 

i 

i 

1 

s 
■^1 

1 

i 


tza78d10.r1 Soares Jsial lungLNbHL19W Homo sapiens cDNA ctone IMAGE:298675 5' similar to 1 
WP:F44F4.8 CE02227 TRANSPOSASE ; 


Human DNA. SINE repetitiue element | 


CM0-HT0244-201099-O7B-a12 HT0244 Homo sapiens cDNA | 


CftylO-HT0244-201099-078-a12 HT0244 Homo sapiens cDNA ( 




iHomo sapiens calcium channel alphalE subunit (CACNA1E) gene, exons 7-49, and partial ods, alternatively 1 
spliced 1 


Ispliced ' ' ' 1 


;tx26h05.x1 NCI CGAP Lu24 Homo sapiens cDNA clone IMAGE:2270745 3' similar to TR:Q13539 Q13539 1 
' MARINER TRANSPOSASE. ; | 


g 

1 

1 

i 

i 
1 

1 
i, 
1 
^ 
1 


LINE-1 REVERSE TRANSCRIPTASE HOMOLOG | 


RC3-HT0549-1 005D0-022-b05 HT0649 Homo sapiens oDNA | 


§ 
1 

1 
1 

< 
0 

i 
1 


|ye40e10.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:120234 3' similar to contains | 
|MER6repefitive element; | 


QV3-BT05S9-270100-074-g05 BT0569 Homo sapiens oDNA | 




qf68h06.x1 SoaresJestis_NHT Homo sapiens cDNA clone IMAGE:1755227 3' | 


i 

1 

1 

I 
1 

1 
» 

1 

1 


Homo sapiens spermidine synthase (SRM) mRNA | 


1 Top Hit 1 


Database 
Source 




T 


i 


ISWiSSPROT 1 


ISWlSSPROT 1 


r 


|EST HUlulAN 1 


EST_HUMAN 


iNT 1 


1 
I 


|est_human 1 


iNT 1 






EST HUMAN | 


|EST HUMAN | 


[SWISSPROT 1 




lEST HUMAN | 


EST HUMAN j 


1EST_HUMAN J 


iNT 1 


|EST_HUMAN | 


[EST HUMAN | 


[NT 1 




i 


1 
< 


AF223391.1 


i 

2 


IP13993 1 


IP13993 1 




s 


S 
§ 


|D1 4547.1 1 


IAW379465.1 1 


|AW3r9465.1 J 


IAJ271 735.1 1 


i 

< 


!aF2231391.1 | 


i 


IBE043584.1 | 


IP08547 1 


i 


2 


!t95763.1 1 


IBEO 7421 7.1 1 


IAL163280.2 | 




1 


1 45072081 


Most Similar 
(Top) Hit 
BLAST E 
Value 


2.0E-15 


2.0E-15 




1 2.0E-15I 


1 2.0E-151 


1 2.OE.I5I 




i 




































2.0E. 


1 2.0E- 


1 2.0E- 




















































S 


1 






1 2.07I 




1 1-33| 
























d 


















ORFSEQ 
ID NO: 


23186 






1 247411 


1 24742 1 




1 263311 


26397! 




1 274841 


1 274851 




23185! 






227061 




i 


t 247311 


1 






i 


1 


i 




SEQ ID 
NO: 


,J 


1 13381 1 


1 14415] 


1 






1 161731 


16237 


§ 


1 172771 




1 


13381 


i 






1 








§ 

3 


1 








1 Probe 


SEQID 
NO; 


3465 


i 




( 5097| 


g 




i 


1 6375I 




7410 


1 74101 


8212 


1 


i 






1 3103] 


4262 


i 


i 




6791 


1 


1 
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Top Hit Descriptor 


NEUROFILAMENT TRIPLET H PROTEIN (200 KDA NEUROFILAMENT PROTEIN) (NEUROFILAMENT 1 
HEAVY POLYPEPTIDE) (NF-H) | 


Mus musculus ultra high sulfur Iceratin gene, complete cds | 


Mus musculus ultra high sulfur keratin gene, complete cds | 


Homo sapiens MHC class 1 region | 




Homo sapiens chromosome 21 segment HS21 C047 | 


Homo sapiens chromosome 21 segment HS21 C047 | 


transcriptional regulatory elements) 


MULTIDRUG RESISTANCE PROTEIN 1 (P-GLYCOPROTEIN 1) 


Homo sapiens chromosome 21 segment HS21 C007 1 


COLLAGEN ALPHA 1(111) CHAIN PRECURSOR | 




(UBE2D3) genes, complete cds | 


yf30e07 r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:128388 5' 1 


he38e05.x1 NCI_CGAP_CML1 Homo sapiens cDNA clone IMAGE:292131 2 3' similar to contains Alu 1 
repetitive elemenfccontains LTR8.t1 LTR8 repetitive element ; | 


i 
< 

1 
1 

1 

i 

z 

1 

1 

i 


i 
§ 

o 
O 

i 

a 

1 

1 
i 

1 
1 
1 


URIDINE PHOSPHORYLASE (UDRPASE) 


MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE (MLCK) [CONTAINS: TELOKIN] | 


zpl 8g12.s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IMAGE:609862 3' 1 


1 
§ 

o 

i 

1 

i 

o 

1 

1 

S 




xxl 0b04.x1 NCI CGAP_Pan1 Homo sapiens cDNA clone IMAGE:2837071 3' similar to gb:L2086B 6QS 1 
RIBOSOMAL PROTEIN L4 (HUMAN); | 


xx1 0b04,x1 NCI_CGAP_Pan1 Homo sapiens cDNA clone IMAGE:2837071 3' similar fo gb:L20868 SOS 1 
RIBOSOMAL PROTEIN L4 (HUMAN); 


ya49c07.s1 Soares infant brain 1 NIB Homo sapiens cDNA clone IMAGE:53285 3' similar to contains LI 1 
repetitive element ; | 


f 1 1 


>PROT 1 


























UMAN 1 


UMAN 1 


UMAN 1 


UMAN 1 


PROT 1 


O 


UMAN 1 


UMAN 1 




UMAN j 


UMAN 


UMAN 1 




-SIMS 


z 














SVV1S5 












EST h 




EST h 


i 


SWISS 




EST h 




EST h 


EST H 


EST_H 


TopHitAcession 
No. 


P12036 1 


M27685.1 1 


1 


AF055066.-I | 


Q95156 1 


AL163247.2 | 


AL163247.2 | 


D13391.1 1 


P08183 1 


AL163207.2 | 


P024S1 1 


U79410.1 1 


< 


RC9942.1 1 


1 
1 
1 


AH 95642.1 | 


AM 85642.1 1 


Q16831 1 


i 

o 


i 
i 


AI472167.1 1 


47589771 


AW31 6976.1 1 


AW316976.1 


R16220.1 1 










































































































lip 




















































•§ m 


s. 








S ! 






















? 


? 








^ 


R 




2 




















































ORFSEQ 
ID NO: 


22621 








26779 




277731 


27981 1 


1 


214991 


218531 


220671 








I 






i 








20121 1 


20122 


24774 1 




1 


5216 




i 






7549 


7737 


i 


1 




2170 


3435 


i 


5792 


5911 


S 


i 






7328 


i 


0305 


0305 


8 
S 






















































Probe 
SEQID 
NO: 


1 


1 


i 


5790 1 


6711 






rJ 




1729 








i 






§ 


1 


8719 




1 


3718 




S 
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§5 

X a: a 
^ Q- C 

141. 



.g 3: 1- o 



; 5 S 



'lis 



I I 
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§ 






5 
1 


































to contains MER29 






s 






S 

o 

i 

ia 
S 




s 

5 
1 
5 


Top Hit Descriptor 


1 Homo sapiens DNA for amyloid precursor protein, complete cds 


|oo86e08.s1 NCLCGAP_Kid5 Homo sapiens cDNA clone IMAGE:1 573094 3' similar t 
PMS3 MRNA ;contains OFR.tl OFR repetitive element : 


^attus norvegicus mRNA for rTIM, complete ods 


[Homo sapiens chromosome 21 segment HS21C001 


|Homo sapiens LGMD2B gene 


I5D1844465F1 NiH_MGC_54 Homo sapiens cDNA clone IMAGE:4064945 5' 


|RC1-OT0083-100B00-019-g08 OT0083 Homo sapiens cDNA 


|CM1-NN0063-280400-203-h08 NN0063 Homo sapiens cDNA 


[Homo sapiens chromosome 21 segment HS21C013 


|QV3-HT0458-1 702DO-090-a12 HT045B Homo sapiens cDNA 


|Homo sapiens mRNA for KIAA0397 protein, partial cds 


|Homo sapiens mRNA for KI/\A0397 protein, partial cds 


|RC4-BT0311-141199-011-h06BT0311 Homo sapiens cDNA 


IZONADHESIN PRECURSOR 


ZONADHESIN PRECURSOR 


,ts30fQ3.x1 NCI_GGAP_Pan1 Homo sapiens cDNA clone IMAGE:2230109 3' similar tc 
1 HYPOTHETICAL 51.1 KD PROTEIN ; 


'QVO-HT0103-091 199-OSO-gl 1 HT0103 Homo sapiens cDNA 


AU 1 36779 PLACE1 Homo sapiens cDNA clone PLACE1 005052 5' 


ht09g01 .x1 NCI_CGAP_Kid13 Homo sapiens cDNA clone IMAGE:3146255 3' similar 1 
MER29 repetitive element; 


i 

1 

i 

j 
s 

o 
s 
1 

i 


601380636F1 NIH_MGC_83 Homo sapiens cDNA clone IMAGE:3951008 5"^ 


nl46c04.s1 NCI_CGAP_Pr4 Homo sapiens cDNA clone IMAGErl 04371 8 similar to coi 
MER2g repetitive element; 


ar88d12,x1 Barstead colon HPLRB7 Homo sapiens cDNA clone IMAGE:2152343 3' 


DKFZp434i0830_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434l083l 


qg47e05.x1 Soares_testis_NHT Homo^sapiens cDNA clone IMAGE:1838333 3' similai 
PROTEIN (HUMAN); 


Homo sapiens SET domain and mariner transposase fusion gene (SETMAR) mRNA 


tz94a03.x1 NCI_CGAP_Kid11 Homo sapiens cDNA clone IMAGE:2295204 3' similart 
NEUTRAL PROTEASE LARGE SUBUNIT ; 


III 




HUMAN 








HUMAN 


HUMAN 


HUMAN 




HUMAN 






HUMAN 1 


SPROT 1 


O 

a: 
1 


HUMAN 1 


HUMAN 1 


HUMAN] 


HUMAN 1 


HUMAN 1 


HUMAN 1 


HUMAN 


I 




HUMAN 1 




HUMAN 1 


»- s « 










z 














Z 




22 
1 




~l 


'EST 1 








m 1- 






w' 








1 
























































Top Hit Ac 
No. 


ID87S75.1 


|aA970713. 


IAB01S576. 


IAL163201.: 


3 




IBF361093.- 




|AL1 63213.2 




|AB007857.: 


|AB007857.: 


i 


a 


o 


|aI624582.1 


IBE141786.- 


|AU13S779.- 


|bE350127.1 


IBE973829.1 


^ I 
S g 


AA557657.1 


§ 
< 


|AL079752.1 


|aI223104.1 


1 


AI702438.1 


Most Similar 
(Top) Hit 
BlJiSTE 
Value 


5.0E-21 












1 3.0E-21 


1 3.0E-21 


1 3.0E-21 


1 2.0E-21 


1 2.0E-21 


2.0E-21 




















1.0E-21 








1.0E-21| 




Expression 
Signal 


1 






































s. 








? 








a 




1 21473 






g 
SI 




§ 


1 


14992 






i 




1 








1 








i 










i 


o: Q 
o 


















































Q 


149// 


§ 








8 
















1 


2468 


... 






8287 












6134 




1 


























































Probe 
SEQID 
NO: 


§ 


1 




2228 


i 


5749 






i 


1 






1 






SS 




/122 


8412 


1 8b3b 




1235 






i 


8021 
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Top Hit Descriptor 
H uman 90 kD heat shock protein gene, complete cds 

Homo sapiens PTS gene for 6-pyruvoyltetrahydropterin synthase, complete cds 


QV1-BT0821-120900-360-b03 BT0821 Homo sapiens cDNA 

Human gene for Ah-receptor, exon 7-9 

xv17f03 XI Soar6s_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE;281340S 3' similar to contains Alu 
repetitive element;contains MER19.t2 MERl 9 repetitive element ; 


2t62b01 r1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:7268e9 5' similar to TR-G1335769 

G13357S9 GAG-POL POLYPROTEIN. ; 

Human HsLIM15 mRNA for HsUmIS, complete cds 


O 
1 

o 

ci 

a 
s 
1 

% 

It! 

ill 

u- I 1 

II! 
Ills 

0 <o g W 

ill 

g. 9? ^ 
-s g^i^ 
i ",o 

5. -a X uj 


wr35d1 0.x1 NCI_CGAP_Ut1 Homo sapiens cDNA clone IMAGE;2492563 3' similar to TR:01 5546 01 5546 
HERV-E ENVELOPE GLYCOPROTEIN ; 


Wf27g07.x1 SDares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE;2356860 3' similar to contains 
element MER6 repetitive element ; 


BC_S1 Homo sapiens cDNA clone IMAGE:2356860 3' similar to contains 
3nt; 

segment HS21C048 

segment HS21C048 

segment HS21C048 


OT0032 Homo sapiens cDNA 

segment HS21C027 

tHN0003 Homo sapiens CDNA 

1 Homo sapiens cDNA clone l!^GE:1288332 3' similar to contains MER4.b1 

brain'cDNA Homo sapiens cDNA clone c-23f05 
piens CDNA 5' end similar to EST containing 0 family repeat 
sapiens cDNA 3' 


wf27gD7.x1 Soares_NFL_T_G 
element MER6 repetitive elem< 

Homo sapiens chromosome 2- 


iliil-llii 

^itiiltii:' 


Top Hit 
Database 
Source 

NT 
NT 


EST HUMAN ' 
NT 

EST HUMAN 




NT 

EST HUMAN 


EST HUMAN 


EST HUMAN 


EST HUMAN 

NT 

NT 

NT 


EST HUMAN 
NT 

EST HUMAN 
ESTJHUMAN 


Top Hit Acession 


No. 

J04988.1 
AB042297.1 


15 ii 


AA403053.1 
D63882.1 


'1 i 

i i 
< < 


< ^ 


AI806418.1 


AI806418.1 
AL1 63248.2 
AL163248.2 
AL1 63248.2 


AW983880.1 

AA761215.1 

AA383873.1 
M557072,1 


Most Similar 
(Top) Hit 


BLAST E 
Value 

4.0E-29 
3.0E-29 


3.0E-29 
3.0E-29 

3.0E.29 


3.0E-29, 
3.0E-29 


2.0E-29 
2.0E-29 




2.0E-29 


2.0E-29 
2.0E-29 
2.0E-29 




Expression 
Signal 

4.71 
1.45 


1.31 
2.07 

1.6 










5 S S ? 


1.96 
1.73 
5.41 

3.04 
1-5 
8.81 
3.37 
3.53 


ORFSEQ 


27149 




1 1 


20241 
21278 


21279 
2385S 


25765 


25765 
27539 
27540 
27973 


27174 
25995 

27105 


Exon 
SEO ID 


NO; 

16956 
14214 


g K ?2 


ill 








15658 
17334 
17334 
17729 


18618 
18807 
16982 

15871 
18889 
15721 
16691 
16916 


Probe 
SEO ID 


NO: 

7079 
4317 


4626 
7070 

7362, 
74921 


8591 
9248 

484 


484 
1616 




s 

D3 


6461 
7474 
7474 
7879 
7879 


8804 
9004 
7105 

5966 
9132 
5815 
6812 
7039 
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Top Hit Descriptor 


ha33d06.x1 NCI_CGAP_Kid1 2 Homo sapiens cDNA clone IMAGE:2875499 3' similar to contains THR.bS 1 


C1 8939 Human placenta cDNA (TFuJIwara) Homo sapiens cDNA done GEN-570C01 5' • | 


hd3Qb04.x1 SoarBS_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE:2910991 3' similar to contains 1 
MERl.tS MER1 MER1 repetitive element ; 


Homo sapiens chromosome 21 segment H.S21C003 | 


ac77b08.s1 Stratagene lung (#937210) Homo sapiens cDNA clone IMAGE:35S599 3' | 


602022560F1 NCI CGAP Brn67 Homo sapiens cDNA clone !MAGE:41 57991 5' 


Homo sapiens methionine aminopeptidase; elF-2-associaled p6/ (MNPEP), mRNA | 


EST1 86868 HCC cell line (matastasis to liver in mouse) 11 Homo sapiens cDNA 5' end 1 


601809932F1 NIH MGC 18 Homo sapiens cDNA clone 1MAGE:4040694 5' | 


CHR220532 Chromosome 22 exon Homo sapiens cDNA clone C22_^728 5' 1 


HSC05F032 normalized infant brain cDNA Homo sapiens cDNA clone c-05f03 3' 1 






EST84555 Colon adenocarcinoma IV Homo sapiens cDNA 5' end 1 


hw05a1 1 .x1 NCI CGAP Lu24 Homo sapiens cDNA clone IMAGE;31 82012 3' | 


hw05a1 1 .x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone IMAGE:3182012 3' 1 


Human lambda-immunoglobulin constant region complex (germline) | 


Homo sapiens caioium channel alphalE subunif (CACNA1 E) gene, exons 7-49, and partial cds, alternaiively 1 
spliced 1 


Homo sapiens MHC class 1 region | 


AU1 19105 HEMBA1 Homo sapiens cDNA clone HEMBA1005050 5' | 


RC5-BT0377-091 299-031 -D1 2 BT0377 Homo sapiens cDNA 1 


501433087F1 NIH_MGC_72 Homo sapiens cDNA clone 1MAGE:3918524 5' | 


Homo sapiens type 1 DNA topoisomerase gene, aeon 8 | 


Homo sapiens type 1 DNA topoisomerase gene, exon 8 | 


7k05f04.x1 NCLCGAP_GC6 Homo sapiens cDNA clone IMAGE;3443479 3' similar to TR;Q13537 Q13537 1 
SIMILAR TO POGO ELEMENT. ;contains Ll.tl LI rep^itive element ; | 


Homo sapiens Xq pseudoautosomai region; segment 1/2 | 


POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (PROTEIN-UDP 
ACETYLGAUCTOSAMINYLTRANSFERASE) (UDP-GALNAC:POLYPEPTIDE, N- 
ACETYLGALACTOSAMINYLTRANSFERASE) (GALNAC-T1) 


Homo sapiens chromosome 21 segment HS21 C080 | 


Top Hit 
Database 
Source 


EST HUMAN 


EST HUMAN 


EST HUMAN 


z 


EST' HUMAN 1 


EST HUMAN 1 


NT 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 


EST_HUMAN 1 


NT 1 


NT 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 


NT 1 




NT 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


NT 1 


NT 1 


EST HUMAN | 


NT 1 


SWISSPROT 


NT 1 


Top HitAcession 
No. 


i 




|aW468897.1 


AL1 63203.2 


AA564377.1 | 


BF34772B.1 1 


i 




BF1 83230.1 1 


i 

I 


1 


§ 


1 
I 


1 


1 


1 


X51 755.1 1 


|aF223391.1 | 


AFQ55066.1 | 


AU119105.1 1 




i 

i 


i 

s 


i 

1 




< 


Q10473 


AL163280.2 | 


Most Similar 
(Top) Hit 
BLAST E 
Value 


2,0E.30 




1 .OE-30 


i 


.OE-30 


.OE-30 




^ 














7.0E-31 




7.0E-31 


6.0E-31 


6.0E-31 


6.0E-31 




6,0E-31 


5.0E-31 


5.0E-31 


5.CE-31 




4.0E-31 


4.0E-31 


















§ 

r 










3.24 1 








2.321 


i5 


2.04 1 












2.56| 


s 










2.851 




1.21 1 




S 


2.59 1 


ORF SEQ 
ID NO: 




s 


1 20281 


1 


219531 


221871 


i 




s 






20820 1 






223921 


1 


25272| 






282551 


1 




1 












Exon 
SEQID 
NO: 


1 


i 


i 








8 




6403 


i 


s 




i 




2500 


§ 


1 


i 


i 


i 




9541 


0157 








1 


1674| 


























































Probe 
SEQID 
NO: 


J 








2165 


2413 


2973 




6545 


9733 


70131 


§ 


2362 


i 


2633 






1 3625 1 


6739 1 


81201 




i 






i 




1 


1775| 
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( 


1 

i 

i- 




Homo sapiens K1AA0569 gene product (KIAA0569), mRNA | 


Homo sapiens gene for activin receptor type MB, complete cds | 


Homo sapiens SEC63, endoplasmic reticulum translocon component (S. cerevisiae) like (SEC63L), mRNA | 






Homo sapiens ciiromosome 21 segment HS21 C006 | 


Horse mRNA for ferritin L-chain, complete cds | 


403 RIBOSOMAL PROTEIN SI 5 (RIG PROTEIN) | 


601458531 F1 NIH_MGC_66 Homo sapiens cDNA clone IMAGE:3862086 5' 1 


< 

q 

t 
1 

5 

i 
> 

a 


s 
1 

a 
1 


DKFZp761G1513_r1 761 (synonym: hamy2) Homo sapiens cDNA clone DKFZp761G1 51 3 5' | 


aa88f1 1.S1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IMAGE:838413 3' similar to contains 1 
THR.t2 THR repetitive element; | 


ht09g01 xl NCI_CGAP_Kid13 Homo sapiens cDNA clone IMAGE:314B256 3' similar to contains MER29.b3 1 
MER29 repetitive element ; 


nr06f04.s1 NCI CGAP ColO Homo sapiens cDNA clone 1IV1AGE:1161055 3' similar to TR:Q13537Q13537 1 
MER37 TRANSPOSABLE ELEMENT, COMPLETE CONSENSUS SEQUENCE. ; | 


Homo sapiens B9 protein (B9), mRNA | 


AV71 0948 Cu Homo sapiens cDNA clone CuAALB07 5' 1 


AV71 0948 Cu Homo sapiens cDNA clone CuAALB07 5' | 


601304125F1 NIH_MGC_21 Homo sapiens cDNA clone iMAGE:3638310 5' | 


601304125F1 NIH MGC 21 Homo sapiens cDNA clone 1MAGE:3638310 5' | 




< 

i 
§ 

i 
1 

z 

.1 
1 

1 


Homo sapiens MAGE-B2 (MAGE-B2), MAGE-B3 (MAGE-B3), MAGE-B4 (MAGE-B4), and MAGE-B1 1 
(MAGE-B1 ) genes, complete cds | 


OLFACTORY RECEPTOR 2C1 | 


OLFACTORY RECEPTOR 2C1 


OLFACTORY RECEPTOR 2C1 


DKFZp647B235 r1 547 (synonym: hfbr1) Homo sapiens cDNA clone DKFZp547B235 5' | 


Top Hit 


II 


















SWISSPROT 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 




EST HUMAN | 


z 


EST HUMAN 1 


EST_HUMAN | 


EST HUMAN | 


EST HUMAN | 




i 

T 




SWISSPROT 1 


SWISSPROT 1 


SWISSPROT 1 


EST HUMAN | 


TopHitAcession 


d 


57300381 


11430273] 


1 
1 


6005871 


1 


114203291 


AL1 63206.2 | 


D14523.1 1 


P11174 1 


BF035327.1 1 


AW838171.1 1 


AI393388.1 1 


ALII 9245.1 1 


AA458824.1 | 


BE350127.1 1 


AA877764.1 | 


76615351 


1 
1 


AV71 0948.1 1 




1 


AF1 4851 2.1 1 


AH 14527.1 1 


i 

1 


095371 1 


S 

8 


095371 1 


AL1 34376.1 | 


Most Similar 
(Top) Hit 


BLAST E 
Value 


S 


4.0E-31I 


2 


2 


3.0E-31 


3.0E-31 1 


3.0E-31I 


2 


3.0E-31| 


3.0E-31 1 


2.0E-31I 


2.0E-31I 


2.0E-3l| 


2.0E-3l| 


2.0E-31 


2.0E-31 1 


2.0E-31I 


2.0E-31 1 


2 


2.0E-31I 


2.0E-31I 


2.0E-31| 


2.0E^1| 


l.OE-31 


.0E-31| 


2 


.OE-31 1 


.0E-31| 












Expression 
Signal 


















5 


















En 






















ORFSEQ 


d 










1 


i 




27585| 






1 






1 


1 




















21 403 1 




1 


242091 


Exon 
SEQ ID 
NO: 


12621 j 


1 




12423 




i 


i 


17376| 


17920] 




117711 


120541 




1 


s 




171851 


i 


175501 


17647| 


176471 


1 




g 
8 


11544| 


115441 


1 


1 


Probe 
SEQ ID 

NO: 




1 


1 


1 


i 


i 


1 


1 


1 




1875 


21671 


1 


1 2391 1 




1 7246| 


g 

f2 


1 


77001 


7797 


7797 


1 


9431 




1 


1 


1640 


4535| 
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Top Hit Descriptor 


1 DKFZp547B235 r1 547 (synonym: hfbri ) Homo sapiens cDNA clone DKFZp547B235 5' 


1 
g 
■5. 

% 

i 

l' 
<? 
§ 

i 
1 




qf21h03.x1 NCI CGAP Brn25 Homo sapiens cDNA clone iMAGE:1750709 3' similar to TR:Q16595 1 
Q1 6595 FRATAXIN.; | 


Human farriiliai Alzheimer's disease (STM2) gene, complete cds | 


1 
1 

i 


i 

i 
1 

z 
a 

1 

t 

1 

i 

1 

1 
1 

1 
1 


NUCLEAR ENVELOPE PORE MEMBRANE PROTEIN POM 121 (PORE MEMBRANE PROTEIN OF 121 

KD)(P145) 




tm34a10.x1 NCI_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:21 59994 3' similar to contains MER29.t3 
MER29 repetitive element ; 


601 51 1 530F1 NiH^MGC^71 Homo sapiens cDNA clone IMAGE:3913087 5' 


jHomc sapiens PR01181 mRNA, complete cds | 


|Homo sapiens chromosome 21 segment HS21C046 | 




|Homo sapiens AT-binding transcription factor 1 (ATBF1 ), mRNA | 


iHomo sapiens FLI-1 gene, partial | 


AV731 500 HTF Homo sapiens cDNA clone HTFAKC07 5' 


|AV758S34 BM Homo sapiens cDNA clone BMFBBH12 5' | 


AV758634 BM Homo sapiens cDNA clone BMFBBH12 5' 


|zi95a07.s1 Scares fetal liver spleen 1 NFLS SI Homo sapiens cDNA clone IMAGE:448500 3' similar to I 
contains THR.tS THR repetitive element ; 


601156285F1 NIH_MGC_21 Homo sapiens cDNA clone IMAGE:3139701 5' 


|(MLLT4) mRNA | 


(MLLT4) mRNA 


601 1 73631 F1 NIH_MGC_17 Homo sapiens cDNA clone IMAGE:3529159 5' | 


|H,sapiens mRNA for myosin | 


IH.sapiens mRNA for myosin | 


Top Hit 


|l 


2 


EST HUMAN | 




EST HUMAN | 




EST_HUMAN | 


EST HUMAN | 


SWISSPROT 1 


z 


EST HUMAN | 


EST HUMAN 1 


z 






z 


z 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


EST_HUMAN 


EST_HUMAN 1 




Z 1 


EST HUMAN | 


z 


z 


TopHitAcession 


i 


1AL134376.1 | 


IAW391S79.1 1 


|AF048727.1 | 


AI086434.1 1 


|U50871.1 1 


|AV723976.1 | 


IAI056770.1 1 




XI 7283.1 1 


AI478104.1 




|AF116627.1 1 


< 


1 114325741 


1 114325741 


in 7293.1 1 


|AV731S00.1 1 


IAV758634.1 | 


1AV758634.1 1 


i 

1 


IBE279086.1 1 


1 




BE296613.1 ' | 


IZ38133.1 1 


IZ38133.1 1 


IVlost Similar 
(Too) Hit 


BLAST E 
Value 


1 1.0E-31| 




2 




1 9.OE-32I 


2 


S 


7.OE-32I 


7.OE-32I 


6.0E-32 




1 5.0E-32| 


i 4.0E-32I 


1 4.OE-32I 


1 4.0E-32| 


1 3.OE-32I 


1 3.OE-32I 




1 3.OE-32I 


3.0E-32 




3.OE-32I 




1 2.OE-32I 


2.OE-32I 


<? 


Expression 


Signal 
























1 29.57| 










s 
















1 19.01 1 




ORFSEQ 


ID NO: 


i 


1 249271 




1 






























274781 


i 




24882 




1 24454 


259291 


25930 1 


Exon 
SEQ ID 
NO: 




1 15159| 


1 15635| 


1 I8I67I 




159051 




8 




§ 




i 


i 


i 


163421 


10392| 










i 


s 


§ 


% 1 






Probe 
SEQ ID 
NO: 


4535 


5235 


i 




1 28351 




2032 


1 4754I 


i 


1 2702! 


1 


1018 






i 


% 


1 14381 


7404 


7404 


i 


i 


i 


9996 


1 47831 


i 
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Top Hit Descriptor 




AV727B09 HTC Homo sapiens cDNA clone HTCCNC12 5' 1 


Homo sapiens Xq pseudoautosomal region; segment 1/2 1 


yd15e05.r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:108320 5' | 


Ml4c10,r1 Scares placenta Nb2HP Homo sapiens cDNA done IMAGE:1 48722 5' | 


Human G2 protein mRNA, partial cds | 


Human G2 protein mRNA, partial cds | 


Mus musculus DAB/2J hair-specific (hacl-1 ) gene | 


Homo sapiens Npw38-binding protein NpwBP (LOC51 729), mRNA 1 


Human splicing factor SRp55-1 (SRp-55) mRNA, complete cds | 


Rattus norvegicus putative four repeat ion channel mRNA, complete cds 1 


Homo sapiens mRNA for KI/WM435 protein, partial cds 1 


Homo sapiens chromosome 21 segment HS21C009 | 


tt94cC6.x1 NCLCGAP_Pr28 Homo sapiens cDNA clone I1VIAGE:2249194 3' | 


Homo sapiens hypothetical protein FLJ10989 (FLJ10989), mRNA 1 


601458531F1 NIH MGC 66 Homo sapiens oDNA clone IMAGE:3862086 5' | 


ADP,ATP CARRIER PROTEIN, LIVER ISOFORM T2 (ADP/ATP TRANSLOCASE 3) (ADENINE 1 
NUCLEOTIDE TRANSLOCATOR 3) (ANT 3) 


H omo sapiens X-linked anhidtoif c ectodemial dysplasia protein gene (EDA), exon 2 and flanking repeat 1 


1 


Homo sapiens WNT3 precursor (WNT3) mRNA, completecds | 


Homo sapiens WNT3 precursor (WNT3) mRNA, complete cds 


RC2-BT0506-2404CO-016-h08 BT0506 Homo sapiens cDNA | 


MRO-CT006S-280999-002-d1 1 CTC068 Homo sapiens cDNA 1 


601484430F1 N1H_MGC_69 Homo sapiens oDNA clone IMAGE:3886999 5' 1 


601484430F1 NIH_MGC_6g Homo sapiens cDNA clone IMAGE:3886999 5' 


CM4-HT0193-C61099-022-g06 HT0193 Homo sapiens cDNA | 


DKFZp564A1 553_r1 564 (synonym: hfbr2) Homo sapiens cDNA clone DKFZp564A1 563 5' | 


Homo sapiens chromosome 21 segment HS21C010 | 


hh77b06.y1 NCLCGAP_GU1 Homo sapiens cDNA clone IMAGE:2968787 5' 1 


Homo sapiens prohibitin (PHB) mRNA 1 


naa33a08.x1 NCI CGAP Kidi 1 Homo sapiens cDNA clone IMAGE;32581 34 3' similar to TR;07591 2 1 
075912 DIACYLGLYCEROL KINASE IOTA. ; . | 


Top Hit 
Database 




EST HUMAN 1 




EST HUMAN | 


EST HUMAN | 


z 










z 






EST HUMAN 1 


z 


EST HUMAN | 


SWISSPROT 


z 


z 




EST HUMAN | 


EST HUMAN 1 


EST_HUMAN 1 


EST HUMAN 


EST HUMAN | 




z 


1 


z 


EST_HUMAN | 


TopHHAcession 
Na 


1 

< 


AV727809.1 1 


271735.1 1 


1 


i 


§ 


i 




1 




i 

£ 
< 


AB037856.1 1 


AL1 63209.2 | 


1 
§ 
< 




BF035327.1 | 






< 




i 

1 


BE071414.1 1 


i 

i 
1 


BE874052.1 1 


BE874052.1 1 


AW368451.1 1 


AL03SS35.1 1 


AL1 6321 0.2 1 




60311901 














Most Similar 
(Top) Hit 
BIJlSTE 
Value 


1.0E-33 


1.0E-33| 


9.0E-34| 


7.0E-34 


7.0E-34 




6.0E-34I 


6.0E-34| 




5.0E-34I 


5.0E-34I 


2 


5.0E-34I 


4.0E-34i 


4.0E^4| 


3.0E-34I 


2 


I.OE-34I 


2 


1.0E-34| 


1.0E-34| 




1.0E-34| 


1.0E^4 


2 


1.0E-34I 


1.0E-34! 


9.OE-35I 


8.0E-35| 




Expression 
Signal 


i 




§ 
























i 














6 








§ 

CO 


S 




24.251 




ORFSEQ 
ID NO: 








21198] 








253401 




246311 


















i 


8 












276581 




i 




21475I 


Exon 
SEQ ID 
NO; 


i 


s 


1 


i 




i 


§ 






§ 




7937 


1 


1 




1 


i 


S 


1 


i 


4277 




1 


i 


1 


7442 


9307 


° 


i 


i 


























































Probe 
SEQ ID 
NO: 






i 


1427 


9340 


1 






i 


1 


71441 


1 


8578 


1 


26871 


i 


1 


i 


1 


§ 


i 


i 


i 


i 


i 


i 











301 




302 



wo 01/57274 PCT/USOl/00666 





304 



wo 01/57274 PCT/USOl/00666 



Top Hit Descriptor 


1 
E 

1 

1 
1 

8 
1 

S 
i 

X 


PM3-BN0176-100400-001-g04 BN01 75 Homo sapiens oDNA | 


RETROVIRUS-RELATED POL POLYPROTEIN [CONTAINS: REVERSE TRANSCRIPTASE ; 1 
ENDONUCLEASE] 


601298574F1 NIH MGC 19 Homo sapiens cDNA clone 1MAGE:3628386 5' | 


§ 
1 

a 

o 

o 
z 

■i 
i 


601282266F1 NIH_MGC_44 Homo sapiens cDNA clone IMAGE:3604168 5' 1 


601282266F1 NIH MGC 44 Homo sapiens cDNA clone IMAGE:36041 68 5' | 




Homo sapiens DNA for amyloid precursor protein, complete cds | 


Homo sapiens DNA for amyloid precursor protein, complete cds | 


ZU69C1 0.n Soares_testis_NHT Homo sapiens cDNA clone 1MAGE:743250 5' | 




AV753629 TP Homo sapiens cDNA clone TPGABH01 5' 1 


Homo sapiens neurexin ill-alpha gene, partial cds | 


Homo sapiens KIAA0952 protein (KIAA0962), mRNA | 


Mus musculus junctophilin 1 (Jpl-pending), mRNA 1 


601458531 F1 NIH_MGC_66 Homo sapiens cDNA clone 1MAGE:3862086 5' | 




< 

O 
1 

9 
a 


Mus musculus p47-phox gene, complete cds |, 


EST06648 Infant Brain, Bento Scares Homo sapiens cDNA clone HI BB J28 5' end f 


yc44a07.r1 Stratagene liver (#937224) Homo sapiens cDNA clone IMAGE:83508 5' \ 


601300938F1 N1H_MGC_21 Homo sapiens cDNA clone IMAGE.3635480 5' [ 


RC1 -HT021 7-1 31 1 99-021 -h07 HT021 7 Homo sapiens cDNA [ 


RC1-HT0217-131 199-021-h07 HT0217 Homo sapiens cDNA | 


6021 35493F1 NIH_MGC_83 Homo sapiens cDNA clone IMAGE:4272886 5' 


Wb37c12.x1 NCI CGAP GC5 Homo sapiens cDNA clone IMAGE:2307862 3' similar to contains Alu II 
repetitive element; | 


zo51 ai 2.r1 Stratagene endothelial cell 937223 Homo sapiens cDNA clone IMAGE:590398 5' | 


zo51 ai 2.r1 Stratagene endothelial cell 937223 Homo sapiens cDNA clone IMAGE:590398 5' 


QV3-NNl023-010600-199-h01 NN1023 Homo sapiens cDNA t 


Top Hit 
Database 
Source 




EST HUMAN | 


SWISSPROT 


EST HUMAN | 


EST HUMAN | 


1 


EST HUMAN | 






z 


EST HUMAN | 




EST HUMAN 1 






z 


EST HUMAN | 


EST HUMAN | 


EST HUMAN 1 




EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


TopHitAcession 
No. 


114178621 


BE01 0038.1 1 


P10266 


BE382574.1 ] 


1 


BE389299.1 1 


i 

i 


" 1 


D87675.1 1 


D87675.1 1 


8 

1 


114205161 


AV753629.1 1 


§ 
< 


1 


1 


1 


BE259267.1 | 


1 
1 


AF267747.1 | 


T08756.1 1 


1 


BE409310.1 1 


BE146523.1 1 




BF673761.1 1 


A1867714.1 1 


i 


AA1 48034.1 j 


1 i' 
I i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


5.0E-36I 


4.0E-36I 




4.0E-361 


3 




4.0E-36I 




4.0E-36 


4.0E-35| 


4.0E-36| 


4.0E-36I 


4.0E-35I 


3.0E-36j 


3.0E-36I 


1 




2.0E-36I 


2.0E-36I 


^ 


2.0E-36I 


2.0E-36I 


OE-36 


.OE-36 


3 




OE-36 


1 






















g 

11 
|-« 








^ 


















s 
























d 




s 




s 1 




ORF SEQ 
ID NO: 


253371 


i 




21389 






23037 1 


s ; 


5 i 


27051 1 


1 








220331 






22857 1 


245341 




255561 


1 




1 


1 








s i 




Exon 
SEQ ID 
NO: 


9010 










3231 


3231 




I i 


















3057 




1 


5483 


1 




1 


1987 


1 


§ 


§ 


§ ! 


76741 






























































Probe 
SEQ ID 
NO: 


S 


1205 




1 


2175 


331 0| 


331 0| 


ii 


3 g 


69791 




9334 


9380 1 




2252 


44021 


1 


3132 


48771 


5371 


55671 


i 


S 








5752 


1 


11 


7824 1 
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1 
1 
: 

& 


GM3.NN0061-140400-147-h12 NN0061 Homo sapiens cDNA 


|UI-HF-BN0-ale-c-03-O-Ul.r1 NIH_MGC_50 Homo sapiens cDNA clone IMAGE:3079277 5' | 


IHomo sapiens Ran GTPase activating protein 1 (RANGAP1), mRNA 1 


1 Homo sapiens chromosome 22 open reading frame 2 (C220RF2), mRNA 1 


1 Homo sapiens chromosome 21 segment HS21 C01 3 | 


IHomo sapiens Sadi uno-84 domain protein 2 (SUN2) mRNA, partial cds 1 


ws80b07.x1 NCLCGAP_Co3 Homo sapiens cDNA clone IMAGE:2504245 3' 


ws80b07j<1 NCI CGAP Co3 Homo sapiens cDNA clone IMAGE:2504245 3' 


1 73D4 Human retina cDNA Tsp509l-cleaved sublibrary Homo sapiens cDNA not directional 1 


1 Homo sapiens mRNA for KIAA0877 protein, partial cds | 




ht09g01.x1 NCI_CGAP_KId13 Homo sapiens cDNA clone IMAGE:3146256 3' similar to contains MER29.b3 1 
MER29 repetitive element ; 


ht09g01 .x1 NCLCGAP_Kid13 Homo sapiens cDNA clone IMAGE:3146256 3' similar to contains MER29.b3 1 

MER29 repetitive element ; 


|RC1-CN0008-210100-012-a09_1 CN0008 Homo sapiens cDNA 1 


H.sap(ens DMA, DMB, HLA-Zl, IPP2, LMP2, TAP1, LMP7, TAP2, DOB, DQB2 and RING5, 9, 13 and 14 


1 DKFZp434EQ422_r1 434 (synonym: htes3) Homo sapiens cDN A clone DKFZp434E0422 5' | 


V»k25b11.x1 NCLCGAP_Brn25 Homo sapiens cDNA clone IMAGE:2413341 3' similar to contains PTR5.t2 1 
PTR5 repetitive element ; | 


tm87g03.x1 NCI_CGAP_Bm25 Homo sapiens cDNA clone IMAGE:21 55140 3' similar to contains LI .b3 LI 1, 
repetitive element ; 


Homo sapiens Sadi uno-84 domain protein 2 (SUN2) mRNA, partial cds |, 


EST1 78035 Colon carcinoma (HCC) cell line Homo sapiens cDNA 5' end | 


|EST178035 Colon carcinoma (HCC) cell line Homo sapiens cDNA 5' end 1 


< 
1 
g 
I 
% 

1 
1 

1 
i 

t 

0 
1 

1 


Homo sapiens NODI protein (NODI) gene, exons 1. 2, and 3 1 


0 
1 

0 

i 

.1 
1 

i 

X 

i 

1' 
1' 

i' 

s 


|EST52g10 WATM1 Homo sapiens cDNA clone 52g10 similar to human STS G04101 | 


DKFZp434L2418_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434L2418 1 


|DKFZp434L2418_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434L241 8 | 


EST373222 MAGE resequences, MAGF Homo sapiens cDNA | 


IEST373222 MAGE resequences, MAGF Homo sapiens cDNA | 


Top Hit 
Database 
Source 


lEST HUMAN 1 


|EST HUMAN ! 


|NT 


INT 


iNT 1 


NT 


|EST HUMAN | 


lEST HUMAN 


lEST HUMAN 1 


iNT 


EST HUMAN 


IeST HUMAN 


EST_HUMAN 


|EST HUMAN | 


z 


EST_HUMAN | 


EST HUMAN | 


EST HUMAN 




EST HUMAN | 


EST HUMAN 1 




z 


EST HUMAN 1 


EST HUMAN | 


EST_HUMAN 1 


EST HUMAN | 


EST HUMAN | 


EST_HUMAN | 


1 
1 




i 
1 


|AW 5041 43.1 1 




11418121 1 


|AL1 6321 3.2 | 


1 


tAW009277.1 1 


1AW009277.1 1 


|W22618.1 1 


< 


1 


BE350127.1 1 


BE350127.1 


|AW840840.1 1 


X87344.1 




AI81 7700.1 1 


AI535702.1 


AF202723.1 | 


AA307123.1 1 


AA307123.1 1 


75571171 


AF149773.1 | 


AA702794.1 1 


N62051.1 1 


1 


8 

1 


AW961 150.1 1 


i 


IWost Similar 
(Top) Hit 


BLAST E 
Value 


1 .OE-36 


l.0E-36| 


1.0E-36| 


l.0E-36| 


1 .OE-36 


l.0E-36| 


1 9.0E-37| 


1 9.0E-37I 


1 9.0E-37I 


1 8.0E-37I 


8.0E-37I 


8.OE-37I 


8.OE-37I 


8.0E-37I 


8.0E-37 


7.0E-37I 


S 


7.0E-37 


6.0E-37I 


5.0E-37I 


5.0E-37I 


S 


i 


4.0E-37I 


4.0E-37I 


S 


3.0E-37I 


3.0E-37I 
















1 


s 

1 




1 3.91 1 


1 3-74 1 






















i 








4.16 






3.37| 






2.23 1 




s 


2.85 1 






ORFSEQ 


ID NO: 














s 


1 














1 






i 






2571 7| 






221451 






217561 






Exon 
SEQ ID 
NO: 


S 


18574 




1 








16231 


1 


s 




s 


s 




i 








1 




5615 




i 


12254 


S 


i 


1 


12341 






























































Probe 
SEQ ID 
NO: 






1 9203j 


1 9374| 


s 


g 






1 94621 


1 


5213 




§ 


5584 


i 


1263 


1 8I40I 


s 


1 


5707 


1 5707| 


^8292 










1 19701 


S 
Si 


i 
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Top Hit Descriptor 




Homo sapiens mRNA for KIAA1244 protein, partial cds 1 


Homo sapiens ubiquitin specific protease 13 (isopepfidase T-3) (USP13) mRNA | 




7H1 5A04 Chromosome 7 HeLa cDNA Library Homo sapiens cDNA clone 7H1 5A04 1 


601 288958F1 NIH MGC 8 Homo sapiens cDNA clone IMAGE:361 91 56 5' | 






Homo sapiens chromosome 21 segment HS21 C046 | 




EST70527 T-cell lymphoma Homo sapiens cDNA 5' end similar to similar to zinc finger protein family 1 


hz40g01.x1 NCLCGAP_GC6 Homo sapiens cDNA clone IMAGE:3210480 3' 


Homo sapiens fatty acid desaturase 1 (FADS1 ), mRNA | 


Homo sapiens fatty acid desafurase 1 (FADS1 ), mRNA | 


AV653028 GLC Homo sapiens cDNA clone GLCDGF04 3' | 




Homo sapiens chromosome 21 segment HS21 C085 | 


POL PROTEIN. ; | 


Homo sapiens X-linked anhidroific ectodermal dysplasia protein gene (EDA), exon 2 and flanking repeat 1 
regions 


Homo sapiens KIAA0433 protein (KIAA0433), mRNA | 




PM0-BN0167-070500-002-h12 BN0167 Homo sapiens cDNA 1 




RC1-CN001 7-1 20200-0 12-e04 CN0017 Homo sapiens cDNA | 


Wh12f07.x1 NCI CGAP Kidll Homo sapiens cDNA clone IMAGE:2380549 3' 1 


(TM) and short cytoplasmic domain, (semaphorin) 5A (SEMA5A), mRNA | 


Top Hit 


Database 
Source 


m 


NT 


NT 1 


NT 1 


EST HUMAN 1 


EST HUMAN 


NT 


MX 


NT 1 


EST HUMAN | 


EST HUMAN 


EST HUMAN 1 


NT 1 


NT 1 


EST HUMAN 


EST HUMAN | 




EST HUMAN | 






EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


NT 1 


. 

I 


I 


45O75I2I 


AB033070.1 1 


1 
1 


45078481 


AA078165.1 1 


BE396541.1 | 


U60325.1 


i 


AL1 63246.2 | 


AA361 275.1 | 


AA361 275.1 


BE504766.1 1 




g 


AV653028.1 | 


AV653028.1 | 


AL1 63285.2 | 


i 

< 


AF003528.1 


1 




1 


BE009416.1 1 


AW841585.1 1 


AI925949.1 | 


1 141 7342 1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


9.0E-40 


9.OE-40I 




9.0E-40I 


8.0E-40 


? 


7.0E-40 


1 


7.OE-40I 


6.0E-40 


6.OE-40I 


6.OE-40I 


6.OE-40I 


6.0E-40I 


6.0E-40I 


6.OE-40I 


t 




4.0E-40 


f 


4.OE-40I 




4.0E-40 


4.OE-40I 


3.OE-4OI 


3.OE-40I 










d 




0.95 1 








2.83 1 






2.071 


















3.761 




4.87 


4.071 


1 


6.251 


ORFSEQ 
ID NO: 


212051 


23586 1 












1 




















1 






26738] 


§ 


1 






1 


Exon 
SEQ ID 
NO: 


11338 


1 










i 


s 


18150 


1256o| 


§ 


155311 




§ 


i 








11951 




16542| 






179891 




i 


Probe 
SEQ iD 
NO: 


1433 




4250 


1 


30041 


3847 


1 


i 


82701 


1 


1 


1 


61411 


5141 1 


7731 1 




1 


,j 


2061 


4291 


i 


72371 


7237 




4040 


6001 1 
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Top Hit Descriptor 


EST367438 MAGE resequences, MAGC Homo sapiens cDNA 


I601061284F1 NIH MGCJO Homo sapiens cDNA clone IMAGE:3447620 5' | 


1 Human endogenous retrovirus pHE.1 (ERV9) | 


|U|.H-B11-afh-^04-0-Ul.s1 NCI_CGAP_Sub3 Homo sapiens cDNA clone IMAGE:2721871 3' | 


Homo sapiens partial C9 gene for complement component C9, exon 1 1 


Homo sapiens NADH-ubiquinone oxidoreductase AGGG subunit precursor liomolog mRNA, nuclear gene 


Homo sapiens NADH-ubiquinone oxidoreductase AGGG subunit precursor homolog mRNA, nuclear gene 1 
1 encoding mitochondrial protein, complete cds | 


iHomo sapiens rec (LOC51201), mRNA | 


Homo sapiens PDNP1 gene, exon 17 | 


Homo sapiens major histocompatibility complex, class II, DM alpha (HLA-DMA) mRNA | 


Homo sapiens origin recognition complex, subunit 5 (yeast homolog) like (0RC5L) mRNA, and translated 1 
products 1 


Homo sapiens KIAA0255 gene product (KIAA0255), mRNA | 




1 Homo sapiens chromosome 21 segment HS21 C080 | 


|RC3-ST0197-161099-012-a03 ST0197 Homo sapiens cDNA | 


Homo sapiens proteasome inhibitor (PI31), mRNA f 




IHomo sapiens ryanodine receptor 3 (RYR3) mRNA 1 


IHomo sapiens mRNA for KIAA1288 protein, partial cds ( 








|zt79a07.s1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:728532 3' | 


IAV736824 CB Homo sapiens cDNA clone CBLAKH08 5' t 


|AV736824 CB Homo sapiens cDNA clone CBLAKH08 5' [ 




1 Homo sapiens hypothetical protein FU20297 (FU20297), mRNA f 


1 Homo sapiens hypothetical protein FU20297 (FU20297), mRNA \ 


]2822251.5prime NIH MGC 7 Homo sapiens cDNA clone IMAGE:2822251 5' [ 


Top Hit 
Database 
Source 


|EST_HUMAN 


lEST HUMAN 


z 


|EST_HUMAN 


z : 


E z 










z 


z 


z 




lEST HUMAN | 


1- 




1- 


1- 




z 


z 


lEST HUMAN | 


|EST HUMAN | 










|EST HUMAN j 


TopHitAcession 
No. 


|AW955368.1 1 


IBE538919.1 | 




|AW295809.1 | 


AJ251818.1 1 


i i 
I i 


AF067166.1 1 


11423219] 


lAF1 10296.1 1 


1 51 74458 1 


1 


E 


|AL1 63267.2 | 


i 

s 


IAW813617.1 1 


1 58031221 


i 


1 


IAB033114.1 1 


i 


i 


1 47579691 


IAA435719.1 | 


IAV73B824.1 | 


IAV736824.1 | 


1 




1 


|AW246442.1 ] 


1 UJ 


t 


2.0E-42I 










I.OE-42I 


t 


t 


I.OE-42I 




1.0E-42I 








l.0E-42| 




t 








3.0E-43I 


3.0E-''r3| 


5.CE-43I 


5.0E-43I 




3.0E-43 


3.0E-43I 


r.0E-43| 






























































§ 

11 


1 10.2| 






2 






















i 


1 2.851 








i 




1 3.891 


s 




1 12.13] 






1 4.33] 




ORF SEQ 
ID NO: 


1 255011 




R 


i 


20842 


I 1 


i 


1 




22264] 




23347| 


23553 


1 


241 70| 


243161 






244421 


i 


i 


27893 






20387] 


1 20441] 




I 20443] 


% 


= s .. 

S CjO 


15437 




S 


1 


11001 




i 






12372 


i 


13561 


13760 




14383 


14528 




s 


i 


1 


§ 


17655 


18273 




10573 








13503 






























































Probe 
SEQID 
NO: 


55191 






10261 


§ i 


I 1 


.J 


1673 


1 


1 24971 


1 


1 3647| 


1 


s 


1 


1 


4640 


1 


1 


•5048 


5048 


7805 


1 




2 




i 


i 


1 



316 



wo 01/57274 PCT/USOl/00666 




317 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



1^ 



f| 8 



318 



wo 01/57274 PCT/USOl/00666 




319 



wo 01/57274 PCT/USOl/00666 



Top Hit Descriptor 


Homo sapiens adaptor-related protein complex 4, sigma 1 subunit (CLAPS4), mRNA | 


|Homo sapiens DNA for amyloid precursor protein, complete cds | 


|PM4-SN0016-120500-003-a04 SN0016 Homo sapiens cDNA | 


Homo sapiens chemol<lne (C-C motif) receptor 9 (CCR9), mRNA | 




§ 
1 
I- 

8 
s 
[ 

5 
i 
I 
I 
J 

I 

I 
1 

i 

1 

i" 

; 
c 


Homo sapiens glutamate receptor, metabotropic 3 (GRM3], mRNA 1 


501286914F' NIH MGC 44 Homo sapiens cDNA clone IMAGE:3613586 5' | 


IcDNA clone TCBAP2795 | 


i Human mRNA for KIAA0376 gene, partial cds | 




[Homo sapiens Misshapen/NIK-related kinase (MINK), mRNA | 




RC1-CT0249-030300-026-ti1 2 CT0249 Homo sapiens cDNA 1 


1 RC1 -BN0039-1 1 0300-01 2-b01 BN0039 Homo sapiens cDNA j 


|Homo sapiens chromosome 21 segment HS21 C103 | 


z//53d02.r1 Soares_total_fetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:773763 5' similar to 1 
contains THR.t3 THR repetitive element ; f 


|zw53d02.r1 Soare5_tofaLfetU5_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:773763 5' similar to [ 
contains THR.t3 THR repetitive element ; [ 


1 

1 

i 

Q 

.1 
1 

i, 
1 

1 
f 


complete cds; and L-type calcium channel a> 


Iaa01c09.s1 Scares NhHMPu SI Homo sapiens oDNA clone IMAGE:811984 3' ] 


]Homo sapiens alpha satellite DNA, Ml monomer type ( 


]Homo sapiens alpha satellite DNA, Ml monomer type | 


]AV714608 DCB Homo sapiens cDNA clone DCB3YE03 5' | 


]Homo sapiens Sushi domain (SCR repeat) containing (BKS5A5.2), mRNA \ 


1 RC1 -CT01 98-1 50999-01 1 -COB CT01 98 Homo sapiens oDNA | 




]Homo sapiens hypothetical protein FLJ10379 (FU10379). mRNA ( 


Top Hit 


Source 


NT 1 


NT 1 


EST HUMAN 1 


NT 1 


NT 1 


NT 1 


EST HUMAN | 


EST HUMAN | 


NT 1 


INT 1 


INT 1 


|NT 1 


EST HUMAN | 


EST HUMAN ] 


iNT 1 


EST HUMAN 1 


EST HUMAN 


|EST HUMAN 1 




lEST HUMAN 


iNT 1 


iNT 1 


|EST HUMAN | 


iNT 1 


Iest human 


]EST HUMAN 


|NT 1 


TopHftAcession 


No. 


1 


D87675.1 1 


AW864379.1 


1 




< 


114192261 


1 


i 


AB002374.1 1 




7657334] 


76573341 


i 


IAW994803.1 ] 


|AL1 63303.2 ] 


1 


AA434554.1 1 


i 
1 


< 


i 


IAJ130755.1 ! 


]AJ1 30755.1 1 


IAV714608.1 1 






1 


% 

i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


2.0E-44I 




I 


20E-44I 


2.OE-44I 


2.0E-44| 


2.0E-44 


2.0E-44I 


2.0E-44I 


2.0E-44I 


1.0E-44| 


1.0E-44I 


1.0E-44I 


3 


1.0E-44] 


I.OE-44I 


I.OE-44I 


i 


1.0E-44 


1 1.0E-44I 


? 


] 1.0E-441 


1.0E-44 


3 


] I.OE-44I 


] 1.0E-44| 


1 9.0E-45I 


Expression 


Signal 








1.391 


^ 




g § 




2.72 






3.64 






5.54 






1-05 














3.43 




1 1.74| 


ORF SEQ 
ID NO: 






24152 


257191 


24870 1 


1 






249101 




i 




20312] 






21965 






22489 




. 24702 


247031 




1 28918 


28970 


1 


1 241591 


Exon 




12438 


133421 




156171 


15107! 


16245 






19735 






8 


90901. 


§ 


I 


i 


12064 


12716] 


i 




1 149311 


14931 


s 


1 


i 


i 


1 14370| 


Probe 
SEQ ID 
NO: 


2567 


3425 


4468 


5709 


1 


i 


7009 




9710 








s 




1555 


J 


2177 


2237 


2732 


i 


1 


i 




i 




1 


1 
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Top Hit Descriptor 


601284360F1 NIH MGC 44 Homo sapiens cDNA clone IMAGE:360S1 83 5' | 


601284360F1 NIH_MGC_44 Homo sapiens oDNA clone IMAGE:360S1 83 5' | 


Homo sapiens RAP1A, member of RAS oncogene family (RAP1 A), mRNA | 


Homo sapiens Langerhans cell specific c-type lectin (LANGERIN), mRNA | 


Human pro-a2 chain of collagen type XI (COL11A2) gene, complete cds 1 




Homo sapiens mRNA for KIAA1591 protein, partial cds 1 


601289116F1 NIH_MGC_8 Homo sapiens cDNA clone IMAGE:3619803 5' | 




601 51 1226F1 NIH MGC 71 Homo sapiens cDNA clone IMAGE:3912535 5' | 


Human mRNA for KIAA0299 gene, partial cds | 


Homo sapiens protein kinase C, alpha binding protein (PRKCABP), mRNA I 


Homo sapiens hypothetical protein FLJ20454 (FLJ20454), mRNA | 


Homo sapiens Ran GTPase activating protein 1 (RANGAP1), mRNA | 


Homo sapiens calciurr. channel, voltage-dependent, alpha 1 1 subunit (CACNA1 1), mRNA | 


Mus muECuius keratin complex 2, gene 6g (Krt2-6g), mRNA 1 




2822449.5prime NIH MGC 7 Homo sapiens cDNA clone IMAGE:2822449 5' | 


ti32f08.x1 NCI_CGAP_Gas4 Homo sapiens cDNA clone IMAGE:2132199 3' similar to gb:J00314_ma2 1 
TUBULIN BETA-1 CHAIN (HUMAN); 


fi32f08.x1 NCI CGAP Gas4 Homo sapiens cDNA clone IMAGE:2132199 3' similar to gb:J00314 rna2 11 
TUBULIN BETA-1 CHAIN (HUMAN); I 


RC5-HT0506-280200-012-C12 HT0506 Homo sapiens cDNA [ 


601277292F1 NIH MGC 20 Homo sapiens oDNA clone IMAGE:3618119 5' [ 


RC4-BT0310-110300-015-f10 BT0310 Homo sapiens cDNA f 




601 822835F1 NIH MGC 77 Homo sapiens cDNA clone IMAGE:4042736 5' | 


Homo sapiens chromosome 21 segment HS21C046 |: 


f08.)d NCI CGAP Ut4 Homo sapiens cDNA done IMAGE:2437575 3' similar to contains MER1 9.t2 
9 repetitive element; | 


f08.x1 NCl_CGAP_Ut4 Homo sapiens cDNA clone IMAGE;2437575 3' similar to contains MER19.t2 li 
9 repetitive element; | 


IOjcINCI CGAP Kid8 Homo sapiens cDNA clone IMAGE:2232835 3' similar to TR:O60363 060363 1. 
ENE.; - - I 


IS 






Top Hit 
Database 


EST HUMAN ] 


EST HUMAN | 












EST HUMAN 1 




EST HUMAN | 
















EST HUMAN 1 


EST HUMAN 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN | 




EST HUMAN | 




EST HUMAN | 


EST HUMAN 


EST HUMAN | 


TopHitAcession 
No. 




BE389855.1 | 


4506412| 


1 

% 


U32169.1 1 


1 


i 
i 


BE396633.1 1 


115457961 




AB002297.1 | 


1 


115262911 




114181571 


1 
§ 


AL1 63209.2 | 


AW246964.1 | 


AI433261.1 


AI433261.1 1 


BE1 67244.1 1 


BE386165.1 | 




1 


BF105845.1 1 


AL1 63246.2 | 


i 

< 


AI884381.1 


AI635448.1 | 


Most Similar 
(Top) Hit 
BLAST E 
Value 


I 


.OE-45 


T 




.OE-45! 


l 


.OE-45, 


.OE-45 


.OE-45 


ui 


.0E-45| 


.0E-45| 


.OE-45 




.0E-45| 




9.0E-46I 


9.0E-46| 


8.0E-46 




8.0E-46I 




7.0E-46 


7.0E-46I 


7.0E-46I 


7.0E-46| 


6.0E-46 


6.0E-46| 


6.0E-46I 
































Expression 
Signal 




























^ 
































ORFSEQ 
ID NO: 






20227| 




227841 










273471 










25207] 


i 






22165 










256931 






22483 1 


22484I 




Exon 
SEQ ID 
NO: 


8 


1 


S 


1 




i 


1 


4274 












1 


9324 


1 


i 


















? 


1 


25881 






























































Probe 
SEQID 
NO: 






i 




lg 

8 


3447 


1 


4378| 




7274 


74851 


1 


94151 


1 


1 




70181 


1 


2392 


i 


i 


4478| 


i 


5683 




1 






5727I 



322 




323 



wo 01/57274 PCT/USOl/00666 




324 



wo 01/57274 PCT/USOl/00666 




325 



wo 01/57274 PCT/USOl/00666 




326 



wo 01/57274 PCT/USOl/00666 




wo 01/57274 



PCT/USOl/00666 



^ < s 



i| 
II, 

.1 
I 



III 



328 



wo 01/57274 



PCT/USOl/00666 



11^ 

^^1 



11 



si 



-i = iLi 



si 



329 



wo 01/57274 PCT/USOl/00666 




330 
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Top Hit Descriptor 


Homo sapiens MHO class 1 region | 


Homo sapiens midline 1 (Opilz/BBB syndrome) (MIDI) mRNA j 


Homo sapiens decorin D mRNA. complete cds, alternatively spliced | 




i 

1 

1 


Mus musculus keratin complex 2, gene 6g (Krt2-6g), mRNA | 


Mus musculus keratin complex 2, gene 6g (Krt2-6g), mRNA | 


PM3-BN0137-290300-002-g11 BN0137 Homo sapiens oDNA | 


PM3-BN0137-290300.002-g11 BN0137 Homo sapiens cDNA | 






A51 c09.r1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone IMAGE:486352 5' | 


np98e09.El NCLCGAP Lul Homo sapiens cDNA clone IMAGE:1142440 3' slmllarto gb:X12a71_rna1 1 
HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN AI (HUMAN); | 


Homo sapiens PDZ-73 protein (PDZ-73/NY-CO-38), mRNA | 


AU138590 PLACE1 Homo sapiens cDNA clone PLACE1008887 5' | 


QV4-NT0028-200400-180 d05 NT0028 Homo sapiens cDNA | 


Ixn34a03.x1 NCI CGAP Kidi 1 Homo sapiens cDNA clone IMAGE:2695564 3' similar to TR:Q9Z340 1 
Q9Z340 ATYPICAL PKC SPECIFIC BINDING PROTEIN. ; [ 


DKFZp434B2229 r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434B2229 5' | 


DKFZp434B2229_r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434B2229 5' |i 






Homo sapiens KI/\A0929 protein Msx2 interacting nuclea- target (MINT) homolog (KIAA0929), mRNA [' 




1 Homo sapiens solute earner family 2 (facilitated glucose transporter), member 9 (SLC2A9), mRNA |; 


i 
1 

f 

1 
1 


Homo sapiens mitogen-activated protein kinase kinase 1 (MKK4) gene, exon 4 | 


Homo sapiens mitogen-activated protein kinase kinase 1 (MKK4) gene, exon 4 | 


Homo sapiens cerebral cell adhesion molecule (LOC51 148), mRNA | 


Homo sapiens B9 protein (B9), mRNA ^ 


Homo sapiens interleuWn 17 receptor (IL17R), mRNA |r 


Top Hit 
Database 
Source 




z 


z 










|EST HUMAN | 


|EST HUMAN | 


z 


z 


I 


EST HUMAN | 


z 


jEST HUMAN | 


lEST HUMAN 1 


EST_HUMAN | 


lEST HUMAN | 


lEST HUMAN 1 


|EST HUMAN | 






z 
















Top HitAcession 
No. 


IAF055066.1 | 


1 4557752] 




i 


i 


1 


1 

i 


IBE007080.1 1 


|3E0D7080.1 1 


i 


IAJ271735.1 1 


1 


I/1A610842.1 1 


1 11439587] 


i 


AW889219.1 


i 
1 


i 
g 


IAL079628.1 1 


|AW2956C3.1 | 




1 76572561 


1 99105531 


1 9910553] 


X01788.1 




|AF070083.1 1 


1 11429665] 


1 7661535] 


s 
s 


Most Similar 
(Top) Hit 

Value 


1 2.0E-50 


1 2.0E-50 


s 




1 2.0E-50 


1 2.0E-50 




1.0E-50 


1.0E-50 


s 


1 1.0E-50 




8.0E-51 


i 


t 


1 7.0E-51 


7.0E-51 


1 7.0E-51 


7.0E-51 


1 7.0E-51 


§ 


i 


1 6.0E-51 


] 6.0E-51 


6.0E-51 


§ 


] 6.0E-51 


1 6.0E-51 


1 6.0E-51 


i 


1 1 

r 




















































1 










ORFSEQ 
ID NO: 












1 277611 


1 




200191 






27450 


241 51 1 










237571 


23758 




21714I 


2315o| 


8 


i 


1 25651] 






24859 


1 276491 


28716] 


Exon 
SEQ ID 
NO: 


i 




11330 


i 


i 






i 


10204 




12195 




14361 


i 




13168 


13238 


I3S78 


13978 






13345 






15558 








174341 
































































Probe 
SEQ ID 
NO: 




1 10631 


1425 




i 


1 


1 








2314 


7375 


J 


i 


1 74481 


3245 


1 3317! 




1 


4254 




J 


4212 


4212 


i 


i 


1 


s 


1 7583] 


§ 



331 




332 



wo 01/57274 PCT/USOl/00666 




wo 01/57274 PCT/USOl/00666 




334 



wo 01/57274 



PCT/USOl/00666 




335 



wo 01/57274 PCT/USOl/00666 




336 



wo 01/57274 PCT/USOl/00666 




337 



wo 01/57274 PCT/USOl/00666 




338 
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339 
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Top Hit Descriptor 


|RC1-CT0252-231099-013-b07 CT0252Homo sapiens cDNA | 


|RC3-BN0053-170200-011-h01 BN0053 Homo sapiens cDNA | 


|43c5 Human retina cDNA randomly primed sublibrary Homo sapiens cDNA | 


1 OH R220038 Ctiromosome 22 exon Homo sapiens cDNA clone 022 55 5' | 


[Homo sapiens beta-tubulin mRNA, complete cds | 


Homo sapiens beta-tubulin mRNA, complete cds | 


j Homo sapiens tubulin, beta polypeptide (TUBB) mRNA | 


IHomo sapiens tubulin, beta polypeptide (TUBB) mRNA 1 


regions | 


1 Homo sapiens unctiaracterized bone marrow protein BM031 mRNA, complete cds | 


Homo sapiens uncharacterized bone marrow protein BM031 mRNA, complete cds | 


|Homo sapiens lymphocyte-specific protein 1 (LSP1 ) gene, LSP1-7 allele, partial cds | 


|tm65g12.x1 NCI_CGAP_Brn25 Homo sapiens cDNA clone IMAGE:2163046 3' | 


Itm65g12.xl NCI_CGAP_Brn25 Homo sapiens cDNA clone IMAGE.21B3046 3' | 


Homo sapiens hypothetical protein PRO1304 (PRO1304), mRNA | 


Homo sapiens oncogene TC21 (TC21), mRNA | 


j EST28889 Cerebellum 1 1 Homo sapiens cDNA 5' end | 


E3T28889 Cerebellum II Homo sapiens cDNA 5' end | 


Homo sapiens MHC class 1 region | 




Homo sapiens superkiller viralicidic activity 2 (S. cerevisiae homolog)-like (SKIV2L), mRNA | 




1 


Homo sapiens lysosomal-associated membrane protein 2 (LAMP2), mRNA | 


Homo sapiens bone morphogenetic protein 5 (BMP5), mRNA | 


Homo sapiens KIAA0317 gene product (KIAA03 17), mRNA | 


Homo sapiens mRNA, similar to rat myomegalin, complete cds | 


Homo sapiens nuclear pore complex interacting protein (NPIP), mRNA | 


Homo sapiens nuclear pore complex interacting protein (NPIP), mRNA 1 


Homo sapiens caveolin 3 (CAV3), mRNA | 


1 

E 
.1 

! 
i 




z 

5 


2 

$ 


z 

5 


z 
5 




















1 








1 




























Topi 
Databi 
Sour, 




lEST HU^ 






z 








z 


z 




Z 


|EST HU^ 


|EST HU^ 






lEST HU^ 


|EST HUK 




























TopHitAoession 
No. 


|AW361213.1 


IAW997712.1 1 


1 


I 


IAF141349.1 1 


IAF141349.1 1 


1 


1 45077281 


AF003528.1 | 


|AF21 7508.1 1 


IAF21 7508.1 1 


IAF043349.1 | 


< 


IAI4980B6.1 1 


1 

s 


1 

s 


|AA325826.1 | 


|AA325826.1 | 


IAF055066.1 | 


|AL1 53268.2 | 


1 
§ 




47591 63 1 




i 


1 


1 


59020131 


§ 


S 




Most Similar 
(Top) Hit 
Bl-ASTE 
Value 


1 7.0E-56I 


1 5.0E-56I 




s 


1 4.0E-S6I 


1 4.0E-56I 






1 4.0E-56I 




i 




1 4.0E-56I 


i 


3.0E-5e| 


! 3.0E-56I 


1 3.0E-56| 


3.0E-56| 


3.0E-56I 




I 3.0E-S6I 


3.0E-56I 


3.0E-56I 




3.CE-56I 


3.0E-56I 


3.0E-56I 


3.0E-5e| 


3.0E-56I 


3.0E-56I 


3.0E-56I 




? 


oi 




§. 
















^ 








































OFIFSEQ 
ID NO: 


1 255391 


1 21434| 






i 


1 




1 22432] 


% 


258021 


i 




i 






218921 


?^ 


22805| 










254681 




271851 






28771 1 




25315| 


25316| 


c S .. 

Sag 




1 


17790 






10012 




12541 


1 


15694 




1 












13013 


13674 




S 




1 


i 


i 






i 


1 


18955 


S 


































































Probe 
SEQ ID 
NO: 


6504 


1 










s 


1 




5788 


5788 


1 




1 


1318 


2103 


S 




3761 


s 


4470 


1 




g 

E 


7117 


7987 


1 8124| 




8632 




1 



341 



wo 01/57274 
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1 



342 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



11 
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Top Hit Descriptor 




1 6-b05 NT0057 Homo sapiens cDNA 


0 ? 

1 i 
1 i 

i I 

ll 


27-e07UM0043 Homo sapiens cDNA | 


s 

1 

! 

1 

i 
1 


3 

i 

1 
i 

1 
1 

1 

3 

z 


Homo sapiens placenta-specific 1 (PLAC1 ), mRNA | 


ym51h07.r1 Soares infant brain 1NIB Homo sapiens cDNA clone IMAGE:520i'1 5' | 


Homo sapiens apical protein, Xenopus lae\;rs-like (APXL), mRNA 1 


Homo sapiens hypulhetical protein FLJ10825 (FLJ10826), mRNA | 


Homo sapiens cliromosome 21 segment HS21 C01 8 | 




Homo sapiens Ran GTPase activating protein 1 (RANGAP1 ), mRNA | 


Homo sapiens ATP synthase, H+ transporting, mitochondrial F1 complex, 0 subunit (oligomycin sensitivity 1 
conferring protein) (ATP50) mRNA 


Homo sapiens interleukin 10 receptor, beta (1L10RB), mRNA 


(F9)mRNA | 


Homo sapiens ubiquitin-conjugating BIR-domam enzyme APOLLON mRNA, complete cds | 


i 

i 
1 
1 

i 
1 

E 
X 


[Human mRNA, Xq terminal portion | 


I Homo sapiens EGF-like repeats and discoidin Mike domains 3 (EDIL3), mRNA | 


iHomo sapiens E1B-55kDa-associated protsin 5 (E1B-AP5), mRNA 


|yg10e02.r1 Soares infant brain 1N1B Homo sapiens cDNA clone IMAGE:31693 5' 1 


1 Homo sapiens peptide YY (PYY) mRNA 1 


I602185789F1 NIH_MGC_45 Homo sapiens oDNA clone IMAGE:4309943 5' 


I 

1 
g 
< 

D 

i 

1 
1 

O 

i 

s 


|AV712977 DCA Homo sapiens cDNA clone DCAAZG04 5' 1 


1 Homo sapiens 5-aminoievulinate synthase 2 (ALAS2) gene, complete cds 1 


ba08b07.y1 NIH MGC 7 Homo sapiens oDNA clone 1MAGE:2823733 5' similar to gb:X69391 60S 
RIBOSOMAL PROTEIN LB (HUMAN); gb:X81987 M.musculus mRNA for TAX responsive element binding 
protein (MOUSE); 


|501499961F1 N1H_MGC_70 Homo sapiens cDNA clone 1MAGE:3901911 5' | 


§ 

3 


CM3-UM0043-240300-1 


1 

s 

I? 

i 
o 


CM3-UM0043-240300-1 


Top Hit 
Database 
Source 


NT 


2 
1 


1; 


EST HUf/AN 1 


Z 


EST HUmN 1 






iNT 1 




NT 


iNT 1 


z 


k 


iNT 1 


NT 


NT 


NT 


iNT 1 


NT 




lEST HUMAN ] 


Int 1 




X 


lEST HUMAN 


z 


EST HUMAN 


lEST HUMAN 


TopHitAcession 
No. 


4507334) 


BE763984.1 1 


AW797948.1 1 


I 1 
?l 


i 


1 
1 




1 
I 


1 11421330] 




AL1 6321 8.2 


1 115262931 


11418177 


1 4502302 1 


1 45046341 


4503648 


1 
< 


|U36251,1 1 


ID16470.1 1 


i 

i 


1 114240591 


|R1 7879.1 


i 


i 




|AV71 2977.1 






IBE907186.1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


5.0E-58 


5.0E.58 


5.0E-58 


5.0E.58 




5.0E-S8 


5.0E-58 




1 5.0E-58I 




5.0E-58 


5.CE-58 




1 4,0E-58| 


4.0E-58 


4.0E-58 


1 4.0E-58I 


4.0E-58 


1 4.0E-58I 


1 4.0E-581 


4.0E.58 


3.0E-5a 


1 3.0E-58I 


i 


3.0E-58 


1 3.0E-58 


2.0E-58 


2.0E-58 


1 2.0E-58 


Expression 
Sign^ 






! 




2.99 




1 2,21 


i 




6.771 


^ 


3,261 




17,971 






























1 4.42 


ORF SEQ 
ID NO: 


1 


20451 


20932 


20932 


i 






1 


25924 


26721 








20147 


20548 


21221 


22297 




23004 


233801 


1 




21129 






i 








Exon 
SEQ ID 
NO: 


10261 


I06?6 




8 § 




13202 


15378 




1 


16527 




§ 


19362 


10325 


% 


11357 




12461 


13204 


13594 




10291 


11273 


1 


i 








i 




























































Probe 
SEQID 
NO: 




i 








3281 




5748 


i 


i 


r787P 


9215 




i 




1452 






3283 


1 


8651 






r3ui 


3141 


1 




i 
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Top Hit Descriptor 


1 

1 

1 
< 

i 
1 

CD 
5 

1 
1 


am57e02.x1 Johnston frontal cortex Homo sapiens cDNA clone IMAGE;1 539674 3' similar to WP:ZK328.1 
CE05065 UBIQUITIN CONJUGATING ENZyMEl; RECOVERIN SUBFAMILY OF EF-HAND CALCIUM 
BINDING PROTEIN ; 




|Homo sapiens endocytic receptor EndolSO (ENDO180) mRNA, complete cds | 


601890812F1 NIH_MGC_1 7 Homo sapiens cDNA clone 1MAGE:41 31891 5' 


i 

o 

i 

o 

< 
z 

■i 
f 

1 

o 
o 

1 

E 


Human complement component 05 mRNA, 3'end | 




< 
1 

1 

i 
I 
I 

\ 


EST369252 MAGE resequences, MAGD Homo sapiens cDNA | 


1 EST369252 MAGE resequences, MAGD Homo sapiens cDNA | 


1 Homo sapiens partial AF-4 gene, exons 2 to 7 and Alu repeat elements | 


S 
CD 

i 

1 
1 

f 
i 

8 
8 

o 
z 

1 
1 




1 Homo sapiens chondroitin sulfate proteoglvoan 2 (versican) (CSPG2) mRNA | 


1 

1 
1 
1 

f 

1 
1 

I 

1 
I 


1 Homo sapiens transition protein 1 (during histpne to protamine replacement) (TNP1 ) mRNA | 


1 Human prohormone converting enzyme (NEC2) gene, exon 4 | 


|oz43h01,x1 Soares_NhHMPu_S1 Homo sapiens cDNA clone IMAGE:1678129 3' . | 






[Homo sapiens TATA box binding protein (TBP) mRNA | 


whSOdOe.xl NCI_CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2384171 3' 1 


I601458531F1 NIH_MGC_66 Homo sapiens cDNA clone IMAGE:3862086 5' | 


o 

1 
1 

i 
.1 

i 
1 

1 
% 


H.sapiens DNA for ZNF80-llnl<ed ERV9 long terminal repeat 1 


au66c07,x1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2781228 3' similar to contains 1 
element TAR1 repetitive element ; | 


|AV762869 MDS Homo sapiens cDNA ctone MDSEIC12 5' 1 


IHomo sapiens hypotiietical protein (LOG57143), mRNA 1 


1 Human mRNA for KIAA0184 gene, partial cds | 


Homo sapiens ryanodine receptor 3 (RYR3) mRNA | 


Top Hit 
Database 


|EST HUMAN | 


EST HUMAN 






1EST_HUMAN j 


|EST HUMAN 






1 


1 




[EST HUMAN 1 












'EST HUMAN | 








lEST HUMAN 1 


i, 


'EST HUMAN | 




1 


iEST HUMAN 1 








TopHitAcession 
No. 


|BE907186.1 1 


AH 24874.1 


IAF134838.1 | 


|AF1 34838.1 | 


|BF307745.1 1 


i 

1 


IM65134.1 1 




1 


IAW957182.1 1 


1 


IAJ238093.1 1 


IBE466132.1 1 


1 


1 47580811 


1 

E 




i 


IAI1410S3.1 1 


1 


1X63392.1 1 


t 45073781 


i 
i 




i 
1 


1 


AW162304.1 1 


|AV762869.1 1 


114349081 


1 


g 

g 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 2.0E-58I 


2.0E-58 


1 2.0E-58I 


g 




1 2.0E-58I 


g 




? 

o 


g 


g 


I.OE-58 


g 


g 


|8S-30-| 


I.OE-581 


g 


I.OE-581 


g 


g 


l.0E-58| 


1 8.0E-59I 


s 


i 


s 


s 


s 


S 


S 


V 


4.0E-59I 




























II 

r 














1 












i 


1 


i 


i 




? 






t 27.471 






<s 


«. 












ORFSEQ 
ID NO: 


1 


25731 








1 284591 


20463 1 




210671 


i 


21136| 




1 


232151 




1 


1 


245531 






21971 1 






228071 


242371 


2485o| 






S 


1 


Exon 
SEQ ID 
NO: 


1 19441 










1 


i 


1 


11212 








i 


1 


1 








§ 




1 


1 




S1.0S1 










10706 






























































Probe 
SEQID 
NO: 


1 




1 6246 1 






8332 






g 


s 


i 








3493 


1 




4627 


1 




1 


1 2182| 


i 




3088 






i 


1 82781 




1 
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1 op r-iit uescnptor 






1 Homo sapiens mRNA for KIAA1081 protein, partal cds | 


IHuman mRNA for integrin alplia-2 subunit 1 


[Homo sapiens S-antigen; retina and pineal gland (arrestin) (SAG), mRNA 1 


1 Homo sapiens KIAA0433 protein (KIAA0433), mRNA | 


Homo sapiens KIAA0433 protein (KIAA0433), mRNA | 


Homo sapiens cliromosome 21 segment HS21 C004 | 


1 Homo sapiens ctiromosome 21 segment HS21 C004 1 


Homo sapiens MHC class 1 region | 


Homo sapiens MHC class 1 region | 


! Homo sapiens interieul<in 1 0 receptor, beta (IL1 ORB), mRNA ] 


|Homo sapiens cullin 4A (CUL4A) mRNA, complete cds | 


Homo sapiens orniUiine decarboxylase 1 (0DC1 ) mRNA | 


|yr12f04n Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:205087 5' similar to contains ( 
LTR5 repetitive element ; 


Iyr12f04.r1 Soares fefal liver spleen 1NFLS Homo sapiens oDNA clone IMAGE:205087 5' similar to contains 1 
LTR5 repetitive element ; | 


|yq78h09.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:201953 5' similar to contains 1 
OFR repetitive element; 


iWf52o07.x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE5359212 3' ] 


i 

z 

1 

t 

i 

CO 

t 


UI-HF-BNO-akt-g-07-O-Ul.rl NIH MGC 50 Homo sapiens cDNA clone IMAGE:3078348 5' | 


UI-HF-BNO-akt-g-07-0-Ul.r1 NIH MGC 50 Homo sapiens cDNA clone IMAGE:3078348 5' 


EST1 1 498 Uterus Homo sapiens cDNA S end similar to similar to retrovirus-related pol | 


601 336446F1 N1H_MGC_44 Homo sapiens cDNA clone IMAGE:3590395 5' ] 


1601 336446F1 NIH MGC 44 Homo sapiens cDNA clone IMAGE:3690395 5' ] 


Homo sapiens prohibitin (PHB) mRNA ] 


Homo sapiens Xq pseudoautosomal region; segment 1/2 j 


1 

"3 

•1 

1 
I 

S 
1 


|ol60h11.y5 NCI CGAP Kid3 Homo sapiens cDNA clone IWAGE:1 534053 5' similar to SW:UDP l«OUSE 1 
IP52624 URIDINE PHOSPHORYLASE ; 






Top Hit 
Database 
Source 


|NT 1 


INT 1 


iNT 1 


iNT 1 


NT 


iNT 1 


INT 1 


iNT 1 


NT 


iNT 1 


1 INl 


lNT_ 1 


z 


iNT 1 


EST_HUMAN 


EST HUMAN | 


EST HUMAN 


lEST HUMAN j 


|EST HUMAN j 


|EST HUMAN j 


lEST HUMAN j 


|EST_HUMAN ] 


[EST HUMAN ] 


[EST HUMAN | 


NT _ 


NT 


iEST HUMAN | 


EST HUMAN 1 


INT ] 


INT ] 


§ 








IAB029004.1 1 


1X17033.1 1 




1 114171181 




|AL1 63204.2 | 


|AL1 63204.2 | 


|AF055066.1 j 


< 


1 45046341 


|AF077188.1 1 


1 45054881 


|h58041.1 1 


|h58041.1 1 


H52456.1 


IAI807917.1 1 


IAI807917.1 ] 


1AW503208.1 1 


IAW503208.1 1 


|AA299037.1 ] 


IBE562611.1 1 


IBE562S11.1 1 


8 


|AJ271 735.1 1 


IAW836196.1 ] 


AI792814.1 1 


1 51746441 




Most Similar 
(Too) Hit 


BLAST E 
Value 


s 


S 


1 8.0E-60I 


1 8.0E-601 


1 8.0E-60I 


1 8.0E-60I 


1 8.0E-60| 


1 s.oE-eoj 


1 8.0E-60I 


S 


1 7.0E-60I 


1 7.0E-60| 


1 7.0E-60| 


s 


1 7.0E-6o| 


1 7.0E-6o| 


e.oE-eo 


L s.oE-eoj 


S 


s 


? 


1 4.0E.60 


1 3.0E^0| 


I 3.0E-60i 


1 3.0E-60I 


1 3.0E-eo| 


3.0E-60] 


3.0E-60 


3.0E-60] 


; 




Signal 


cvi 


cvi 






t 2.26| 








5.38| 


§. 










3.28 










0.931 


0.93| 




4.851 








2.06! 








ORF SEQ 


ID NO: 


i 


i 


1 256451 




1 


s 


1 


1 283441 




20504] 


1 20504] 


1 20567| 


i 


] 23765] 


27488 


1 28843I 






1 


21979] 


i 




215911 


21592] 






1 254431 


1 24875! 


§ 




Exon 
SEQ ID 
NO: 




12013| 






u 


1 


1 




s 


i 


§ 




11971 


1 


17279 






10062 


§ 




12075 


i 








14250 


15383 


15111 


16775 
































































Probe 
SEQ ID 
NO: 


1 2125! 


1 2125 


56401 


i 


1 7174| 


7451 


7451 


8207 


1 8207] 




738 




1 20811 


§ 


1 7412 


i 


i 


S 




s 


1 2188] 


2942 


1815 




1 1826] 


9 


s 


6153 


3 


i 



348 
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Top Hit Descriptor 


Homo sapiens PXR2b protein (PXR2b), mRNA | 


Homo sapiens PXR2b protein (PXR2b), mRNA | 


Homo sapiens PXR2b protein (PXR2b), mRIMA | 


Homo sapiens PXR2b protein (PXR2b), mRNA | 


601300938F1 N1H_MGC_21 Homo sapiens oDNA clone IMAGE:3S35480 5' | 


i 

1 
< 

1 

1 
I 

i' 

1 

§ 


Homo sapiens PRa2014 mRNA. complete cds | 


601109238F1 NIH_MGC_16 Homo sapiens cDNA clone IMAGE:3350145 5' | 


i 

i 

I 

i 
1 
\ 

s 

o 


AUl306a9 NT2RP3 Homo sapiens cDNA clone NT2RP3001263 5' | 


Ig-beta/B29=CD79b {alternafively spliced} [human, B cells, mRNA Partial, 375 nt] | 


Human autosomal dominant polycystic kidney disease protein 1 (PKD1) gene | 


Homo sapiens general transcription factor 2-1 (GTF2I) mRNA, complete cds | 


601300938F1 NIH MGC 21 Homo sapiens cDNA clone IMAGE:3635480 5' | 


u .^J..„>u.^o.nnT^ 


Homo sapiens mRNA for K1AA0825 protein, partial cds | 


Homo sapiens amyloid beta (A4) precursor protein (protease nexln-ll. Alziieimer disease) (APP), mRNA 


AV731140 HTFHomosapienscDNAcloneHTFARB01 5' 1 


601309785F1 NIH_MGC_44 Homo sapiens cDNA clone IMAGE:3S31220 5' | 


Homo sapiens hypoBnetical protein FLJ11026 (FLJ11026), mRNA | 


QV3-HT0513-060400-147-d01 HT0513 Homo sapiens cDNA | 


QV3-HT051 3-060400-1 47-d01 HT051 3 Homo sapiens cDNA 1 


yv53d1 1 .s1 Scares fetel liver spleen 1 N FLS Homo sapiens cDNA clone IMAGE;246453 3" similar to 1 
gb:L25444 605 [RIBOSOMAL PROTEIN L35A (HUMAN); | 


Homo sapiens calmegin (CLGN), mRNA | 


yv03f11.r1 Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:270189 5' | 


Homo sapiens ATPase, H+ transporting, lysosomal (vacuolar proton pump) non-catalytic accessory protein 1 
1A(110/116kD)(ATP6N1A),mRNA | 


AV69431 7 GKC Homo sapiens cDNA clone GKCELG06 5' | 


Ul-H F-BNO-akd-f-1 2-0-Ul.r1 NIH MGC 50 Homo sapiens cDNA done IMAGE:3076774 5' | 


Homo sapiens polymerase (RNA) III (DNA directed) (39kD) (RPC39), mRNA | 


Top Hit 
Dataljase 
Source 


z 


NT 1 


NT 1 


NT 1 


5 
Z> 

^1 


EST HUMAN J 


NT 1 


[3 


EST HUMAN | 


EST HUMAN 1 


NT 1 


NT 1 


NT J 


EST HUMAN | 


NT 


NT 


NT 


EST HUMAN 1 


EST_HUMAN 1 


NT 1 


EST HUMAN | 


EST HUMAN 1 


[est human 1 


NT 


z 

1 


z 


EST HUMAN 1 


EST HUMAN 1 


1 


TopHitAcession 
No. 


7706670] 




1 




IBE409310.1 1 


iBE409310.1 1 


|AF11 9860.1 1 


:BE257400.1 | 


1 


:AU 130639.1 1 


IS79249.1 1 


i 

i 


IAF035737.1 1 


IBE409310.1 1 




Is 


S 

i 


AV731 140.1 


IBE396279.1 1 




1 




|n53039.1 1 


47580031 


IN39397.1 1 


E 
1 


1 


IAW500256.1 1 


s 

i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


7.0E-61 


7.0E-61 


S 


7.CE-61 


6.0E-61 


6.0E-61 


6.CE-61 


6.0E-61 


6.0E-61 


s 


6.0E-61 


S 


6.CE-61 


6.0E-61 


5.0E-61 


S.CE-61 


5.CE-61 


4.0E-61 




2.0E-61 


2.0E-61 


2.0E-61 


1 2.0E-61 


s 


s 


1 2.0E-6I 


2.0E-61 


2.0E-61 


s 


Expression 
Signal 


i 


i 




1 






d 


0.95| 








s 












2.76 


















1.33 


1,62| 




ORF SEQ 
ID NO: 


1 


1 


1 


1 


1 


205641 


§ 


213761 


1 


i 




263701 






214211 


22835 








^ 


§ 


20951 


21407I 


1 






272791 


277831 


27979] 


Exon 
SEQ ID 
NO: 


1 


0098 


i 




1 


0723 


i 




1532 


i 


5583 








1558 


11 


is 


? i 




0433 






S 


i 


2472i 




1 




7735 




























































Probe 
SEQID 
NO; 














1 


1612| 


1 


i 


5674 


6344 


1 


9417 












1 






1 


2109 


1 




72121 


77071 


7885| 
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Top Hit Descriptor 


NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTREAM BINDING FACTOR 1) (UBF-1) 
(AUTOANTIGEN NOR-90) 


qg56a04.x1 Scares testis NHT Homo sapiens cDNA clone 1MAGE:18391 50 3' similar to TR:O15103 1 
015103 HYPOTHETICAL 27.3 KD PROTEIN. ; | 


Human zinc finger protein ZNF131 mRNA. partial cds 


Homo sapiens CGI-5S protein (CGl-56), mRNA 1 


Wi04d02.x1 NC1_CGAP_CLL1 Homo sapiens cDNA done IMAGE:2389251 3' 1 


Wi04d02j(1 NCi CGAP CLL1 Homo sapiens cDNA clone IMAGE:2389251 3' | 


Homo sapiens CGI-18 protein (LOC51008), mRNA | 


MR3-STQ203-130100-025-a09ST0203 Homo sapiens cDNA 1 


wx51 e07.x1 NCI_GGAP_Lu28 Homo sapiens cDNA clone IMAGE:2547204 3' similar to SW:GG95_HUMAN 1 
Q08379 GOLGiN-95. icontains element MER22 repetitive element ; | 


Homo sapiens Xq pseudoautosomal region; segment 1/2 | 


Homo sapiens Xq pseudoautosomal region; segment 1/2 | 


Homo sapiens ryanodine receptor 3 (RYR3) mRNA | 


P47245 NARDILYSIN ; 


fh07g09.x1 NIH MGC 17 Homo sapiens cDNA clone IMAGE:2951616 5' | 


Homo sapiens muscle specific gene (M9), mRNA | 


Homo sapiens muscle specific gene (M9), mRNA | 


au71d03.y1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2781701 5' similar to gb:M37104| 
ATP SYNTHASE COUPLING FACTOR 6, MITOCHONDRIAL PRECURSOR (HUMAN); 


au71 d03.y1 Schneider fetai brain 00004 Homo sapiens cDNA clone IMAGE:2781 701 5' similar to gb:M371 04 1 
ATP SYNTHASE COUPLING FACTOR 6, MITOCHONDRIAL PRECURSOR (HUMAN); 


au71d03,y1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2781701 5' similar to gb:M37104(: 
ATP SYNTHASE COUPLING FACTOR 6, MITOCHONDRIAL PRECURSOR (HUMAN); f 


au71d03.y1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2781701 5' similar to gb:M37104| 
ATP SYNTHASE COUPLING FACTOR 6, MITOCHONDRIAL PRECURSOR (HUMAN); | 


EST1 82043 Jurkat T-cells V Homo sapiens cDNA 5' end | 


Wf12b08.x1 Scares NFL T GBC 31 Homo sapiens cDNA clone IMAGE:2350359 3' similar to 1 
gb9<57138_ma1 HISTONE H2B.2 (HUMAN); 


Wf12b08.x1 Scares NFL T GBC SI Homo sapiens cDNA clone IMAGE:2350359 3' similar to 1 
gb:X57138 rnal HiSTONE H2B.2 (HUMAN); | 


Homo sapiens keratin 18 (KRT18) mRNA | 


Top Hit 


Source 


SWISSPROT 


EST HUMAN 


NT 


NT 1 


EST HUMAN 1 


z 

5 


NT 1 


EST HUMAN 1 


EST HUMAN | 


NT ] 


NT ] 


NT ] 


EST_HUMAN 


EST HUMAN ] 


NT 1 


NT 1 


Z 

1 
D 


EST HUMAN 


EST HUMAN 




EST HUMAN | 


EST_HUMAN 


EST HUMAN | 


NT 1 


TopHitAcession 




P17480 






14182551 








AW814393.1 ] 


AI950528.1 1 


AJ271735.1 ] 


AJ271735.1 ] 


4506758] 


AA431 093.1 


AW410687.1 ] 


11425574] 


11425574] 


AW 161479.1 1 


i 
< 


AW 161 479.1 1 


AW 161 479.1 1 


AA31 1281.1 ] 


AI827900.1 


AI827900.1 1 




i 


U0941C,- 




A176280- 


A176280- 




Most Similar 
(Top) Hit 


BUSTE 
value 


7.0E-62, 


7.CE-62 


6.0E-62I 


6.0E-62I 


6.0E-62I 


3 


3 


3 


5.0E-62 


5.0E.62I 


5.0E-62I 


5.0E-62I 


5.0E-62 


5.0E.62] 


5.0E-62I 


5.0E-62] 


4.OE-62I 


4.0E-62 


4.OE-62I 


4.OE-62I 


3 


4.0E-62^ 


4.OE-62I 


4.0E-62] 


1 


Signal 








3.93 1 












3.26] 


3.26] 








4.91] 


4.91] 




3.47 








^ 






ORFSEQ 


d 

Q 


23I84I 








26525] 


26526 


i 


i 


2OI83I 






8 


i 


^ 


28723] 


28724 






s 






22183 1 






ili' 


13378 


18546 


1 


13258| 


16355] 


16355] 


i 




103561 


12236] 


12236] 


i 


14131 




18455] 


18455 


g 


107501 


1075o| 


1075o| 


1 


Si 


E 

Si 


13273| 


Probe 
SEQID 
NO: 


3462 


1 


i 


3338| 


1 


1 


§ 






8 




3372] 


1 


7482 


85871 


8587] 










! 


2409] 




3353| 
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Top Hit Descriptor 


|Homo sapiens partial 5-HT4 receptor gene, exons 2 io 5 | 


mRNA 


Homo sapiens ubiquitin specific protease 9, X chromosome (Drosophila fat facets related) (USP9X), mRNA | 




Homo sapiens eukari^f c translation initiation factor 2B, subunit 2 (beta, 391^) (E1F2B2), mRNA 


IHomo sapiens eul^aryolio translation initiation factor 2B, subunit 2 (beta, 39kD) (EIF2B2), mRNA 


[Homo sapiens mRNA for KIAA1253 protein, partial cds | 




IH.sapiens flow-sorted chromosome B Hindlll fragment, SC6pA16D3 




|Homo sapiens non-histone chromosome protein 2 (S. cerevisiae)-like 1 (NHP2L1), mRNA ( 


IHomo sapiens cadherin EGF LAG seven-pass G-type receptor 1 (CELSR1 ), mRNA 


|Homo sapiens calcineurin binding protein 1 {KIAA0330), mRNA j 


IHomo sapiens calcineurin binding protein 1 (KIAA0330), mRNA | 


Homo sapiens low density lipoprotein-related protein 2 (LRP2), mRNA | 




Homo sapiens mRNA for KIAA1 476 protein, partial cds 1 


Homo sapiens mRNA for KIAA1476 protein, partial cds | 


wa33f04,x1 NCI CGAP Kidi 1 Homo sapiens cDNA clone IMAGE:2299903 3' similar to contains THR.t2 . 


THR repetitive element; |- 


RC0-BN0284-30C600-031 -e05 BN0284 Homo sapiens cDNA |. 


RC0-BN0284-30C500-C31 -e05 BN0284 Homo sapiens cDNA 1' 


(U BE2D3) genes, complete cds V 


z 

I 
t 

1 

i 

! 

0 


Homo sapiens interseotin 2 (SH3D1B) mRNA, complete cds | 


Homo sapiens ADP/ATP carrier protein (ANT-2) gene, complete cds | ; 


afZOelLrl Scares NhHMPu SI Homo sapiens cDNA clone IMAGE:1047404 5' similar to WP:K01H12.1 1 
CE034S3 ; |, 


DKFZp566F104 r1 566 (synonym: hfkd2) Homo sapiens cDNA clone DKFZp566F104 5' | , 


Top Hit 
Database 
Source 


















1- 












z 










EST HUMAN 
NT 1 


EST HUMAN | 


EST HUMAN 1 




EST HUMAN 1 




z 


EST HUMAN | 


EST HUMAN | 




TopHitAcession 
No. 


IAJ243213.1 1 


« 

s 






§ 


1 


|AB033089.1 | 


i 


1 


1 




18322 


17862 


17862 


11 430460 1 




i 


i 
§ 

1 


X52858.1 1 


AI632733.1 

AL1 63284.2 | 


BF329911.1 1 


i 

1 


i 


BF330676.1 1 


AF248540.1 | 


L78810.1 1 


AA625207.1 | 


AL039044.1 ] 












Lost Similar 


(Top) Hit 
BUSTE 
Value 


1 4.0E-62 


1 4.0E-62I 


1 4.0E-62I 


1 4.0E-62I 


4.0E-62I 


1 4.0E-62I 


4.0X6^ 




4.0E-62| 


4.0E-62I 


4.0E.62I 




4.0E-62I 


4.0E-62I 


4.0E-62I 


3.0E-62I 


^ 




S 


3.0E-62 
2.OE-62I 


2.0E-62I 


2.0E-62| 


2.0E-62 


2.0E-62I 


1.0E-62I 


1.0E-62I 


? 


1.0E-62] 


.i _ 

1* 










f3 


Pi 




































i 






q 




ORFSEQ 
ID NO: 




1 


1 25822 




S 


s 




1 




j 28792] 




1 262011 










227281 


22729] 


23340 


27038 
20969] 


S8 








20791] 








Exon 
SEQ ID 
NO: 


14824 




15709 




16357 


i 






18254 


i 


19578 




19302 




19335 


§ 




1 


3554 




f2 






8744 


§ 




1657 




























































Probe 


SEQ ID 
NO: 


1 4947 


i 


1 58041 


i 


i 


i 


7134 




8377 


9135 




9743 


9792 


9/92 


i 




3008 




i 


§ 

S ?! 


1 














i 
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I 



Top Hit Descriptor 






1 

1 

1 
< 

Q 

1 
t 

1 

i 
1 
l' 

i 

i 


[Homo sapiens KIAA0763 gene product (KIAA0763), mRNA | 


1 Homo sapiens K1AA0763 gene product (Kl AA0763), mRNA | 


1 H.sapiens lysosomal acid pliosphafase gene (EC 3.1.3.2) Exon 9 ) 


IH.sapiens lysosomal acid phosphatase gene (EC 3.1.3.2) Exon 9 1 


|aa33d08.s1 NCLCGAP_GCB1 Homo sapiens cDNA clone IMAGE:815055 3' | 


IH.sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA1 4D8 1 


1 Homo sapiens cadherin EGF LAG seven-pass G-type receptor 1 (CELSR1 ), mRNA | 


|Homo sapiens low density lipoprotein-related protein 2 (LRP2), mRNA | 


|QV4-ST0234-1 81 199-037-f05 ST0234 Homo sapiens cDNA | 


1018159 Human placenta cDNA (TFujiwara) Homo sapiens cDNA clone GEN-558C10 5' 1 


Homo sapiens mRNA for KIAA0350 protRin, partial cds | 


jHomo sapiens mRNA for KIAA0350 protein, partial cds ( 




iHomo sapiens mRNA for PkBWnase | 


Homo sapiens nucleoporin 88kD (NUP88), mRNA | 


[Homo sapiens Ras association (RalGDS/AF-6) domain family 2 (RASSF2), mRNA | 




Homo sapiens IL2-inducible T-cell kinase (ITK), mRNA | 


iGaBus gallus Dach2 protein (Dach2) mRNA, complete cds | 


IGallus gallus Daoh2 protein (Dach2) mRNA, complete cds 1 


§ 

8 

z 

1 
! 

1 

1 

5. 

§ 


|wm66g1 1 j(1 NCI_CGAP_Ut2 Homo sapiens cDNA clone IMAGE:2439908 3' 1 


Incesrazrl NCI CGAP Pri Homo sapiens cDNA clone 1MAGE;745947 similar to gb:Y00361 60S 1 
RIBOSOMAL PROTEIN (HUMAN); | 


IHomo sapiens chromosome 21 segment HS21C078 | 


CM3-BT0595-1 90100-072-a09 BT0595 Homo sapiens oDNA | 


;CM3-BT0595-190100-072-a09 BT0595 Homo sapiens oDNA | 


1 
0 

s 

i 
0 

z 

■g 
■l 
i 

X 

2 
1 
5 

1 
1 


i 
1 

i 
0 

1 

f 
1 

1 
5 

i 
9 
I 
■f 

i 


Homo sapiens mRNA for KIAA071 7 protein, partial cds | 


Top Hit 
Database 
Source 


z 


[EST HUMAN | 


|EST HUMAN | 










[EST HUMAN | 






z 


lEST HUMAN | 


lEST HUMAN 1 


z 


z 


z 




z 














iEST HUMAN 1 


EST HUMAN | 




EST HUMAN | 


lEST HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


z 


Top Hit Acession 
No. 


i 
1 












1X15533.1 1 


i 

i 


i 


S 


I 

i 


|AW 81 6405.1 1 


1018159.1 1 


1AB002348.2 | 


|AB002348.2 | 


114181851 


iY1 5056.1 1 


i 


§ 


4557734I 


i 


i 

< 


AF198349.1 | 


AL1 53268.2 | 


AI872137.1 1 


1 


AL1 63278.2 | 


AW750372.1 | 


AW 750372.1 | 


AW 134709.1 1 


AW134709.1 1 


1 
< 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 .OE-62 1 


s 


s 


1 .OE-62 1 
















s 


s 


9.0E-63| 


9.0E-e3l 


9.0E-63I 


9.0E-63I 


s 


9.0E-63I 


8.0E-63 


8.0E-63I 


2 


8.0E-e3| 


8.0E-e3| 


7.0E-e3| 




? 




4.0E-63I 


4.0E-63I 


4.0E-63I 


f 






































Expression 
Signal 




















































1 


1 












ORF SEQ 
ID NO: 


1 241 08 1 


1 




1 27310 1 


1 27311 1 




1 273321 


1 


s 






\ 20109] 






1 




25104| 


26281 




22077I 


22IO2I 


231341 


23135| 












1 


286111 




1 


Exon 
SEQ ID 
NO: 


14321 












17139 


17300 


1 


19199 


19322 


i 


12179 


1 


§ 




15274 


16127 


i 


















15785 




18347 


18347 


1 


































































Probe 
SEQ ID 
NO: 


1 


1 


i 


7239 


7239 


1 


7262 


7512 


i 


i 


9815 




1 22971 


i 


3955 


5210 






i 


i 


2322 


3415 


5 


g 






3279 


S 


5879 


1 84741 


8474 


i 



354 



wo 01/57274 PCT/lIS01/006fi6 
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Top Hit Descriptor 


|Human l(3)mbt protein iiomolog mRNA, complete cds | 


iHcmo sapiens KIAA0618 gene product (KIAA061 8), mRNA | 


|HomosaplensKIAA0618geneproduct(KIAA0618), mRNA ( 


'Homo sapiens putative transcription factor CR53 (CR53) mRNA, partial cds 1 


O 
1 
"1 

in 

8 


|RC3-ST0197-120200-015-a03 ST0197 Homo sapiens cDNA 1 


CI 8895 Human placenta cDNA (TFujiwara) Homo sapiens cDNA clone GEN-569E02 5' | 


I60158955SF1 NIH_MGC_7 Homo sapiens cDNA clone 1MAGE:3943577 5' 1 


AV71 1 71 4 DCA Homo sapiens cDNA clone DCAAMC01 5' | 


AV711714 DCA Homo sapiens cDNA clone DCAAMC01 5' ] 


IH. sapiens isoform 1 gene for L-type calcium channel, exon 28 1 


RC5-FN0019-290600-011-G11 FN001 9 Homo sapiens cDN A | 


|Homo sapiens golgi matrix protein GM130 (G0LGA2) mRNA, complete cds | 


Homo sapiens golgi matrix protein GM130 (GOLGA2) mRNA, complete cds 1 


bb72h12.y1 NIH_MGCJ 2 Homo sapiens cDNA clone IMAGE;3047975 5' similar to gb:L08069 DNAJ 1 

'PROTEIN HOMOLOG 2 (HUMAN); 


PROTEIN HOMOLOG 2 (HUMAN); 


IC046 


IC046 1 


IC046 1 


IC027 1 


i 

1 
1 

.1 
1 

i 
s 

1 
i 


Homo sapiens elF4E-like cap-binding protein (4EHP) mRNA 1 


|Wo87b0lJCl NCI_CGAP_Kid11 Homo sapiens cDNA clone IMAGE:2462281 3' similar to contains element 1 
L1 repetitive element; | 


Homo sapiens chromosome 21 segment HS21 C046 | 






EST37021 5 MAGE resequences, MAGE Homo sapiens cDNA 1 


EST37021 5 MAGE resequences, MAGE Homo sapiens cDNA ( 


AU 1 24387 NT2RM2 Homo sapiens cDNA clone NT2RM20021 13 5' | 


Homo sapiens chromosome 21 segment HS2' 


Homo sapiens chromosome 21 segment HS2' 


Homo sapiens chromosome 21 segment HS2' 


Homo sapiens chromosome 21 segment HS2' 


Top Hit 


Source 


iNT 1 


iNT 1 


z 


'NT 


iEST HUMAN | 


lEST HUMAN 1 


EST HUMAN | 


IEST HUMAN 1 


EST HUMAN 1 


iEST HUMAN ] 


'NT 1 


EST HUMAN | 




NT 1 


X 


EST HUMAN 


NT 




NT 1 


NT 1 


EST HUMAN | 


NT 


EST HUMAN | 


NT 1 


NT 1 


NT 1 


EST HUMAN ] 


EST.HUMAN 


EST_HUMAN ] 


TopHitAcession 




i 


1 


i 


lAFOI 7433.1 1 


i 
i 


|AW81 3783.1 1 


i 


]BE794381.1 j 


[AV711714.1 1 


]AV711714.1 ] 


|Z2e273.1 1 


§ 


|AF248953.1 ] 


|AF248953.1 ] 


BE206521.1 


BE206521.1 


AL1 63246.2 1 


IAL163246.2 | 


|AL1 63246.2 j 


|AL1 63227.2 | 


1 




AI927030.1 1 


;AL1 63246.2 ] 


IALI 63245.2 1 


4504058] 


IAW958145.1 1 


AW958145.1 1 


:AU124387.1 ] 


Most Similar 
(Top) Hit 


BLAST E 
Value 


j 5.0E-64I 


5.0E^ 




S 


I 


4.0E-S4I 


3.0E-64I 


3.0E-64I 


3.0E-64] 


3.0E-64] 


i 

Pi 


3.0E-64] 


3.0E-64] 


1 


3.0E-64 


3.0E-64 


3.0E-64| 


3.OE-64I 


3.0E-64] 


3.0E-Q4] 


s 




2.OE-64I 


s 


2.0E-64| 


2.OE-64I 


2.0E-64I 


2.0E-64I 


2.0E-64| 


Expression 


Signal 


1 1-671 












] 3.851 


] 0.95 1 




1 1.51] 




















i 4.591 


s 








1 3.03 1 


2.56| 


1 1.33] 






ORFSEQ 


ID NO: 


1 21457i 


1 212351 


1 2123^ 


1 23575| 


\ 28324] 


1 28325] 


j 219361 


] 22943] 


23112] 


1 23113] 


I 25713] 




27005 


27006] 


27015 


2701 7I 


274971 


?g 




29040] 


208311 






1 


1 22253] 


S3 


23416 


! 23417] 


i 


Exon 
SEQ ID 
NO: 


11585 


R 


113701 




§ 


18074 


12039 


13140 


13313 


13313 


15611 




i 


i 




i 


is 




i 




i 




12354 


i 






1 


13631 






























































Probe 
SEQID 
NO; 


1 


i 


1 2796| 


1 


1 


1 


2151 


3216 


1 




5703 


5911 


i 






1 






i 




1072 


1376 




1 2484] 


2484 


1 3101 1 


1 3719] 


1 3719] 
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358 



wo 01/572" t 
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11 



g 



359 
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11 
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Hi 



? 9 g 
> l5 a 



0 i 2 1 



: X o o 



I -si 



s_ 

ri 
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Top Hit Descriptor 


O 

i 
1 

a 
1 

1 
o 

o 

i 

i 

5 

3 


iHomo sapiens solute carrier family 25 (mitochondrial can-ier; adenine nucleotide translocator), member 5 1 
|(SLC25A5), nuclear gene encoding mitochondrial protein, mRNA | 


(SLC25A5), nuclear gene encoding mitochondrial protein, mRNA 


|yz27g12.r1 Scares multiple sclerosis 2NbHMSP Homo sapiens cDNA clone 1MAGE:284326 5' similar to 1 
|SW:H2B1 TIGCA P350S8 HISTONE H2B.1/H2B.2. [2] PIR:B56612 ; | 


yz27g12.r1 Soares_multiple_sclerosis_2NbHMSP Homo sapiens cDNA clone IMAGE:284326 5' similar to 1 
SW:H2B1 TIGCA P35068 HISTONE H2B.1/H2B.2. [2] PIR:B56612 ; 


yz27g12.r1 Soares_multiple_sclerosis_2NbHMSP Homo sapiens cDNA clone IMAGE:284326 5" similar to 1 
SW:H2B1_T1GCA P35068 HISTONE H2B.1/H2B.2, [2] PIR:B56612 ; | 




[Homo sapiens K1AA0649 gene product (KIAA0649), mRNA 1 


j Homo sapiens NIPSNAP, C elegans, homolog 1 (NIPSNAP1 ), mRNA 1 


|Homo sapiens NIPSNAP, C, elegans, homolog 1 (NIPSNAP1), mRNA | 




IHomo sapiens Missiiapen/NIK-related kinase (MINK), mRNA 1 


< 

E 

Z 

1 

1 
1 

i 
1 

1 

1 

E 


Homo sapiens origin recognition complex, subunit 5 (yeast homolog)-like (ORC5L) mRNA, and translated 1 
products 


Homo sapiens origin recognition complex, subunit 5 (yeast homolog)-like (ORC5L) mRNA, and translated 1 
products 1 


Homo sapiens chromosome 21 segment HS21 CI 01 | 




IHomo sapiens sodium/calcium exchanger isoform NaCa3 (NCX1) mRNA, ccxnplete cds 1 


Homo sapiens HLA-B gene for human leucocyte antigen B | 


Homo sapiens HLA-B gene for human leucocyte antigen B | 


to 

i 

i 

1 

i 

E 
1 

s 
1 

e 
1 


IHomo sapiens G-2 and S-phase expressed 1 (GTSE1), mRNA | 


AV71 781 7 DCB Homo sapiens cDNA clone DCBADC07 5' | 


IAV717817 DCB Homo sapiens cDNA clone DCBADC07 5' 


IAV717817 DCB Homo sapiens cDNA clone DCBADC07 5' 


IAV717817 DCB Homo sapiens cDNA clone DCBADC07 5' | 


Top Hit 
Source 




NT 




EST HUMAN 


EST HUMAN 


EST_HUMAN 




iNT 1 


iNT 1 


INT 1 


Lt 


NT 


NT 1 


NT 




NT 1 






NT 1 


INT 1 


EST HUMAN | 


INT 1 




EST_HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


Top HitAcession 
No. 


IBF507493.1 | 






N55323.1 1 


N55323.1 


N55323.1 1 


111418801 


i 




1 


i 
1 


76573341 


76573341 


4505524I 


4505524I 


AL1633C1.2 1 


i 


AF108389.1 1 


AJ 133267.2 | 


AJ133267.2 1 


N45480.1 1 


i 


AV717817.1 1 


AV717817.1 1 


AV717817.1 1 


AV717817.1 1 


MostSimiiar 
(Top) Hit 
BLAST E 
Value 


s 


s 


3.0E-66 


I 






s 


3.0E-66I 


3.0E-66| 


3,0E-S5| 




2.0E-66I 


I 


2.0E-66I 


I 


I 


2.0E-65| 


I 


I 


I 


S 


2.0E-55I 


1.0E-66I 




1.0E-66I 


1.0E-55I 


1.1 

r 




i 




O.84I 










? 






1.02] 




s 




2.O2I 






12.691 








1.381 


« 


^ 


3.26| 


ORFSEQ 
ID NO: 






















i 


i 


s 






i 


§ 




1 








1 225851 






1 22586 1 




17943 






i 


i 






§ 


1 


1 


i 


[0033 




i 


s 






1 


1 






































































Probe 
SEQID 
NO: 


1 8052| 


1 


1 


s 


i 


8 


2675 


3079 


i 


i 




•9 








1784 


2944 


3975 


4556 


i 


7135 


9475 




i 
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Top Hit Descriptor 


6021 52996F1 NIH MGC_81 Homo sapiens cDNA clone 1MAGE:42941 51 5' | 


IRC5-BN0193-C1C9CO-034-G06 BN0193 Homo sapiens cDNA | 


i 
1 
Q 

1 
I 

O 

O 

1 


Homo sapiens jun dimerizafion protein gene, partial cds; cfos gene, compieie cos; ana unKnown gene | 


Homo sapiens Ran GTPase activating protein 1 (RANGAP1), mRNA | 


EST01750 Subtracted Hippocampus, Stratagene (cat. #936205) Homo sapiens cDNA clone HHCPN31 1 
similar to L1 repetitive element | 


su75d02.x1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2782083 3' similar togb:M37104| 
ATP SYNTHASE COUPLING FACTOR 6, MITOCHONDRIAL PRECURSOR (HUMAN); | 


EST9681 2 Testis 1 Homo sapiens cDNA 5' end similar to similar to C. elegans hypothetical protein, cosmid 1 
ZK353 1 


1 

i 
§ 
< 

a 
1 
1 
1 

z 

1 
& 

1 


i 

1 

a 
1 

E 

i 

8- 

1 

8i' 

S 






i 

s 

ll 

13 
< < 

ii 

so 

H- 
|i 

is 

Ii 

18 

Ii 
il 


Homo sapiens adaptor-related protein complex 2, beta 1 subunit (AP2B1 ), mRNA | 








Homo sapiens gene for AF-6, complete cds | 




H.sapiens mRNA for acetyi-CoA carboxylase | 


Homo sapiens mRNA for transmebrane receptor protein 1 




Homo sapiens Synapsin III (SYN3) mRNA, and translated products | 


Homo sapiens Synapsin III (SYN3) mRNA, and translated products | 


Homo sapiens chromosome 21 segment HS21 C001 | 


Homo sapiens chrcmosome 21 segment HS21 C001 | 


Homo sapiens DKFZp434P21 1 protein (DKFZP434P21 1 ). mRNA | 


1 

E 

1 

1 

& 
■1 

I 


Top Hit 
Database 


lEST HUMAN | 


|EST HUMAN 1 




NT 1 


1 INl 


EST HUMAN | 


EST HUMAN | 


EST HUMAN 


lEST HUMAN 1 


|EST HUMAN | 


iNT 1 


iNT 1 


EST HUMAN 


|NT 1 


INT 1 


iNT 1 


NT 


|NT 1 


iNT 1 


|NT 1 


iNT 1 


NT 


|NT 1 


iNT 1 


iNT 1 


|NT 1 


|NT 1 




Top HitAcession 
No. 


i 




i 
1 


AF11 1167.2 1 


114181771 


M78158.1 1 


AW162232.1 1 


1 


|W85947_1 1 


1 
1 


1 


7657243 1 


AW 162232.1 1 


1 


i 


1 


1 114304601 


i 
i 


1 114215271 


1 


Z17227.1 1 




45073321 


45073321 


IAL163201.2 1 


|AL163201.2 1 


76570201 


i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 1.0E-66I 


1 1.0E-66| 


1 1.0E-66| 


1.0E-66 


S 


8.OE-67I 


7.OE-67I 


7.0E-67 


1 7.0E-67| 


1 7.0E.67I 


1 7.0E-67| 


s 


7.0E-67 


1 7.0E-67I 


1 7.0E-67I 




7.0E-67 


1 7.0E-67| 


1 7.0E-67I 


1 6.0E-67I 


1 6.0E-67j 


6.0E-67 


1 6.0E-67I 


1 6.0E-67I 


1 6.0E-67I 


S 


1 6.0E-67| 


S 

(d 


.1 _ 

i 1 

r 




! 








6 


« 


% 
























1.32 


















ORFSEQ 
ID NO: 


1 250311 






1 






201 84I 












1 20184 










1 




1 202971 


i 




1 231061 


1 231071 


1 237141 


1 237151 


1 242831 


i 












Exon 
SEQ ID 
NO: 


15227 


i 




i 


i 


14711 


§ 

s 


i 


1 


i 




i 




i 


i 


i 






19357 


i 


i = 




13308 


13308 




13938! 


1 


i 


























































Probe 
SEQID 
NO: 




1 61431 


8 


8312 


i 


1 




i 






i 




1 2779I 


1 5793 1 


1 




i 










3131 


i 


3391 


4035 


1 40351 




1 
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Top Hit Descriptor 




iHomo sapiens T cell receptor beta locus, TCRBV7S3A2 to TCRBV12S2 region 1 


Z 
Q 

.1 
1 

i 
1 
1 

1 
1 

1 


1 yn02d1 1 .r1 Scares adult brain N2b4HB55Y Homo sapiens cDNA clone IMAGE:1 67253 5' 


1 RC0-HT0934-1 50900-026-C03 HT0934 Homo sapiens cDNA | 


jnw06a01.s1 NCI GGAP SSI Homo sapiens cDNA clone IWAGE:1 238472 3' similar to TR:01 0385 010385 1 
PRO-POL-DUTPASE POLYPROTEIN ; 


1 EST37903 Embryo, 9 week Homo sapiens cDNA 5' end | 


|RC4-BT0311-141199-011-h0BBT0311 Homo sapiens cDNA 1 




|iir31f05.x1 NCI CGAP KIdll Homo sapiens cDNA clone IIUlAGE:3134913 3' similar to SW:RHOP MOUSEl 
Q61 085 GTP-RHO BINDING PROTEIN 1 ; | 


!on18b07.s1 Soares_NFI _T_GBC_S1 Homo sapiens cDNA clone IMAGE: 1541 365 3' | 


Ihw16g09.x1 NCI CGAP Lu24 Homo sapiens cDNA clone IMAGE:3183136 3' similar to WP:F23H1 1.9 1 
ICE09617; | 


QV4-ST0234-181199-037-f05 ST0234 Homo sapiens cDNA | 


Homo sapiens double stranded RNA activated protein l<inase (PKR) gene, exons 2a, 2, 3, and 4 1 


ba72g05.y1 NIH MGC 20 Homo sapiens cDNA clone IMAGE:2905976 5' similar to TR:094892 094892 ) 
KIAA0798 PROTEIN. ; | 


ba72g05.y1 NIH_MGC_20 Homo sapiens cDNA clone IMAGE:2905976 5' similar to TR:094892 094892 
'KIAA0798 PROTEIN. ; | 


Homo sapiens KRAB zinc finger protein ZFQR mRNA, complete cds 1 




1 

i 

1 
1 

! 
1 
t 
i 

d 
1 

I 

1 

1 


Homo sapiens chromosome 21 segment HS21 01 00 | 


801875351F1 NIH_MGC_55 Homo sapiens cDNA clone ll\>IAGE:4091 893 5' 1 


Homo sapiens mRNA for NADPH-cytochrome P-450 reductase, complete cds | 


Homo sapiens mRNA for NADPH-cytochrome P-450 reductase, complete cds | 


IRC4-BT0566-170100-011-C07 BT0566 Homo sapiens cDNA | 


RC4-BT0566-1 70100-01 1-c07 BT0566 Homo sapiens cDNA | 


Homo sapiens KIAA0985 protein (KIAA0985), mRNA | 


601175762F1 NIH_MGC_17 Homo sapiens cDNA clone IIVlAGE:3531038 5' | 


PM2-TN01 03-0409C0-001 -c02 TN0103 Homo sapiens cDNA 




Top Hrt 

Database 






lEST HUlvlAN 1 


lEST HUMAN 1 


|EST HUMAN | 


EST HUMAN 


|EST HUMAN | 


lEST HUMAN 1 


lEST HUMAN 1 


s 


2 


EST HUMAN 


[EST HUMAN | 




z 

i 

i 


EST HUMAN | 






lEST HUMAN 1 




z 

1 

I 






EST HUMAN | 


lEST HUMAN | 




z 

i 


[EST HUMAN 1 




Top HitAcession 
No. 


t 45078481 


i 

< 


|BE010038.1 1 




IBF357321.1 1 


AA714294.1 


i 
i 




i 

i 
1 




i 


i 


IAW816405.1 1 


j 


BE303037.1 | 


BE303037.1 | 


IAF309561.1 1 


47587951 


IAA625755.1 1 


iALI 63300.2 | 


i 


1AB051763.1 1 


1 


IAW602635.1 | 


IAW602635.1 | 


114364481 


IBE295714.1 | 


IBF377169.1 1 


i 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 6.0E-67I 


1 5.0E-67| 


1 5.0E-67| 


1 4.0E-67I 


S 


4.0E-67 


1 3.0E-67I 




1 3.0E-67| 


3.0E-67I 


s 




S 




2.OE-67I 


2.OE-67I 


1 2.0E-67I 


1 2.0E-67I 


1 2.0E-67I 


! 2.0E-67I 


S 


i 2.0E-67| 


s 


1 2.0E-67I 


1 2.0E-67I 


1 2.0E-67I 


S 


1 2.0E-67I 


1 2.0E-67| 


Expression 
Signal 








































^ 




















ORF SEQ 
ID NO: 




1 




§ 










1 242701 


26827 1 




1 


206041 






2I6I5I 




1 221531 


231441 




1 257341 




1 25821] 


1 27264| 








1 281441 


S 
g 


Exon 
SEQ ID 
NO: 


14761 


13109 








18290 


8 




14484 




i 




10754 


I 




11737 




12261 






15631 


15708 


15708 


g 


17079 


1 


s 


17900 






























































Probe 
SEQ ID 
NO: 


1 51251 


3184 


1 83521 




s 




2782 


1 34071 


1 


8 


i 






1 






1 23361 




1 34221 


8 


1 57241 


5803 






1 72021 


8409 


i 


8751 


i 
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Top Hit Descriptor 


nl45h10.s1 NCl_CGAP_Pr4 Homo sapiens cDNA clone IMAGE:1043683 similar to contains PTR5.t3 PTR5 
repetitive element ; | 


Homo sapiens chromosome 21 segment HS21 C006 


1 Human mRNA for KIAA0272 gene, partial cds 


1 Human mRNA for KIAA0272 gene, partial cds | 


iHomo sapiens short chain L-3-hydroyyacyl-CoA dehydrogenase precursor (HADHSC) gene, nuclear gene 1 


Homo sapiens short chain L-3-hydroyyacyl-CoA dehydrogenase precursor (HADHSC) gene, nuclear gene 


bbSlaOe.yl NIH_MGC_10 Homo sapiens cDNA clone IMAGE:3048754 5' similar to SW:R23B HUMAN 
P54727 UV EXCISION REPAIR PROTEIN PROTEIN RAD23 HOMOLOG B ; 


|ye43e09.r1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:120320 5' | 


loyl 5e03.s1 Soares_senescent_fibroblasts_NbHSF Homo sapiens cDNA clone IMAGE:1665916 3' similar to 1 


IHomo sapiens neuronal cell death-related protein (LOC51616), mRNA 1 


|Homo sapiens disab!ed-2 gene, exons 2 through 15 and complete cds | 


IHomo sapiens phosphatidylinositol 4-kinase230 (pl4K230) mRNA, complete cds 


|Homo sapiens PMS2L16 mRNA, partal cds | 


IHomo sapiens PMS2L1 6 mRNA, partial cds | 


Homo sapiens hairy/enhancer-of-split related with YRPW motif-like (HEYL), mRNA 




1 Homo sapiens SNARE protein kinase SNAK mRNA, complete cds | 


Homo sapiens SNARE protein kinaseSNAK mRNA, completecds 




p2_1 5 Human Epidermal Keratinocyte Subtraction Library- Upregulated Transcripts Homo sapiens cDNA 


Homo sapiens attractin precursor (ATRN) gene, exon 19 


IHuman mRNA for KIAA0045 gene, complete cds 




Homo sapiens CAGL79 mRNA, partial cds | 


Homo sapiens mysmesln (M-protein) 2 (1 55kD) (MYOM2), mRNA | 


Homo sapiens hypothetical protein FLJ10998 (FLJ10998), mRNA | 


Top Hit 
Database 
Source 


|eST HUMAN 




z 








EST HUMAN 


? 


EST HUMAN 


Z 


Z 


Z 


Z 


z 


z 


z 


z 




i 


EST HUMAN | 


z 


z 


z 


z 


z 




.3 

i 






























3571531 
















1 




♦25430 1 


1 


Top Hit Ac 
No. 


|aA557683 


|AL16320S. 


ID87462.1 


i 


|aF095703. 


EO/SBOdVl 


|BEai8477 


IT95489.1 


|aI077927.1 




|AF205890. 


|AF012872. 


|AB017007. 


|AB017007. 




i 


IAF246219. 


|AF24S219. 




BE1 22850. 


iAF218904. 


§ 




U80753.1 






Similar 
>p) Hit 
\STE 
alue 




2.0E-71 




2.0E-71 


2.0E-71 












i 


1 .OE-71 1 








I.OE-71 




1 .OE-71 1 


1 .OE-71 1 


1 .OE-71 1 






1 .OE-71 1 






























































§ 

1^ 






1 


i 




S 




^ 






































ORF SEQ 
ID NO: 




§ 


1 24957 i 


1 










20371 1 








1 


21822 


22416] 


23179| 






1 




i 


24050| 


1 






i 


Isi 












17872 


1 






g 










12528 


13373 




13462 










15947 




s 


























































Probe 
SEQID 
NO: 


8077 j 




S 


1 


1 8022 


1 8022 1 


§ 




s 




i 


1317 


2036 


2036 


1 


3457 




3546 




1 


i 


4370 


§ 




6750 


§ 



371 



wo 01/57274 PCT/lIS01/006fi6 



1 

& 


Homo sapiens hypothetical protein FLJ10998 (FLJ10998). mRNA | 




AV761217 MDS Homo sapiens cDNA clone MDSEIA03 5' 


AV761217 MDS Homo sapiens cDNA clone MDSEIA03 5' | 


Homo sapiens coagulation factor XIII, AI polypeptide (F1 3A1 ), mRNA | 


Homo sapiens leucyl/cystinyl aminopeptidase (LNPEP), mRNA | 


E 

1 

t 

.1 

S 
1- 
1 
i 
I 


Homo sapiens gene for AF-6, complete cds | 


wkE5g03.x1 NGI_CGAP_Lu19 Homo sapiens cDNA clone IMAGE:2423188 3' similar to TR:O86705 0867051 
HYP0THETICAL38.6KD PROTEIN. ;containsAlu repetitive element; 


wkgSgOS.xl NCI CGAP Lu19 Homo sapiens cDNA clone IMAGE:2423188 3' similar to TR:O86706 0867051 
HYPOTHETICAL 38.6 KD PROTEIN, jcontains Alu repetitive element; | 






Homo sapiens aconitase 2, mitochondrial (AG02), nuclear gene encoding mitocondrial protein, mRNA | 




8 
i 

E 

! 

1 
I 

i 
1 


7k53a05.x1 NCI_CGAP_GC6 Homo sapiens cDNA clone 1MAGE;3480080 3' similar to SW:KMLC_RABIT 1 
P07313 MYOSIN LIGHT CHAIN KINASE, SKELETAL MUSCLE ; 1 


QVO-CSOOIO-1 50900-398-611 CSOOlO Homo sapiens cDNA | 




QV0-CS001 0-1 50900-398-el 1 CS001 0 Homo sapiens cDNA j 




1 
1 

< 
E 

1 

i 
1 
1 


AU 1 28584 NT2RP2 Home sapiens cDNA clone NT2RP2003751 5' | 


au80o03.y1 Schneider fetal brain 00004 Homo sapiens cDNA clone IMAGE:2782564 5' similar to 
TR:Q99785 Q99785 HYPOTHETICAL 32.4 KD PROTEIN jcontains element MSR1 repetitive element ; 


MR4-BT0598-01 0600-005-d05 BT0598 Homo sapiens cDNA 


MR4-BT0598-01 0600-005-d05 BT0598 Homo sapiens cDNA 


QV1-BT0532-280800-342-a10 BT0632 Homo sapiens cDNA | 


Top Hit 
Database 
Source 






EST HUMAN 1 


EST_HUMAN | 


z 


z 






IeST HUMAN 1 


|eST HUMAN 1 


z 


Z 


z 


Z 


z 


Iest human 1 


EST HUMAN | 


EST_HUMAN ] 


EST HUMAN ] 


EST HUMAN ] 


z 


EST HUMAN ] 


EST HUMAN 


EST HUMAN 1 


EST HUMAN 1 


EST HUMAN ] 


Top Hit Acession 
No. 




AY007643.1 | 






1418903] 


5 




AB01 1399.1 1 


1 
1 


|aI857635.1 1 


1 
i 


4501866 


1 
1 


S41 694.1 1 


i 
5 


|bF059578.1 


BF333707.1 ] 


BF333707.1 ] 


BF333707.1 ] 


BF333707.1 ] 






AW161274.1 


BF331 571.1 ] 


BF331571.1 ] 




AV76121 


AV76121 








Most Similar 
(Top) Hit 
BLAST E 
Value 






















7.0E72 






7.0E-72! 












5.0E-72 




5.0E-72 


5.0E-72 






5.0E-72 


















Expression 
Signal 




1 






























i 


1 
















9 


ORF SEQ 
ID NO: 


i 


1 


















23700 


23701 




26244] 




1 


i 




1 






S 






28707 




Exon 
SEQ ID 
NO: 




1 


S 


§ 


18135| 


S 
S3 








10347! 


13926 


13926 






s 


1 


10043] 


i 


10043] 


S 


i 


?3 

8 




18437] 


18437 




Probe 
SEQID 
NO: 


i 


77481 


7792 




!§ 




1 


i 


i 


i 




4023 


4023 


i 


1 












1122 


6150 


7097 


1 


i 


9253] 



372 



/;/E:A^¥O01 57274 cpc; 



WO 01/57274 



PCT/USOl/00666 



li 



m 



UP 



i 1 



go 



373 



wo 01/57274 PCT/USOl/00666 



Top Hit DesCTiptor 


Homo sapiens myosin, heavy polypeptide 13, skeletal muscle (MYH13), mRNA | 


RG4-HT0578-1 70300-012-g02 HT0578 Homo sapiens cDNA | 


RC4-HT0578-1 70300-01 2-g02 HT0578 Homo sapiens cDNA | 


Homo sapiens synaptic glycoprotein SC2 (SC2) mRNA, complete cds | 


Homo sapiens synaptic glycoprotein SC2 (SC2) mRNA, complete cds | 


MR0-CT0063-071 099-002-h1 1 CT0063 Homo sapiens cDNA 


Homo sapiens ribosomal protein L13a (RPL13A), mRNA | 


ws55c06.x1 NCI CGAP_Brn25 Homo sapiens cDNA clone IMAGE:2501 098 3' similar to TR;Q59050 1 
Q59050 HYPOTHETICAL PROTEIN MJ1556. ; | 


1 

g 

z 

I 
t 

1 

i' 
1 


Homo sapiens iysozyme homolog (LOC571 51 ), mRNA 1 


Homo sapiens vacuolar ATPase isoform VA58 mRNA, complete cds 1 


bb62aOS.y1 NIH„MGC„9 Homo sapiens cDNA clone IMAGE;3030034 5' similar to gb:X04098_cds1 ACTIN, 
CYTOPLASMIC 2 (HUMAN); gb:M214e5 Mouse cyfoskeletal gamma-actin mRNA, complete cds (MOUSE); 


Homo sapiens Interleukin 12 receptor, beta 1 (IL12RB1), mRNA 1 


Homo sapiens Interleukin 12 receptor, beta 1 (IL12RB1), mRNA 1 


Homo sapiens DNA for Human P2XM, complete cds | 


Homo sapiens thyroid autoantigen 70kD (Ku antigen) (G22P1), mRNA [ 


Homo sapiens hypothetical protein FLJ20309 (FLJ20309), mRNA 1 


Homo sapiens chromosome 21 segment HS21C006 f 


Homo sapiens chromosome 21 segment HS21 C082 [ 




QV0-HT0494-020300-137-d03 HT0494 Homo sapiens cDNA 


Homo sapiens HELG protein (FAM4A1), mRNA 1 




Homo sapiens heme-bindIng protein (HEBP), mRNA | 


Homo sapiens BASS1 (BASS1) mRNA, partial cds ] 


RC3-NN00SS-270400-01 1 -c04 NN0066 Homo sapiens cDNA | 




z 

(C 
E 

1 
t 

1 

i 
1 

1 




Top Hit 
Database 
Source 


NT 1 


EST HUMAN | 


EST_HUMAN | 




NT 1 


EST HUMAN 1 


NT 1 


EST HUMAN 1 


EST HUMAN 1 


NT 1 


NT 


2 




NT 1 


INl 


z 


z 


NT 




NT 1 


1 


NT 


Z 


NT 1 


z 


EST_HUMAN ] 


NT 1 


NT ] 




Top Hit Acession 
No. 


11321 578| 


BE1 75434.1 


BE1 75434.1 


i 
< 


AF222742.1 | 


AW374968.1 1 


11 424099 1 


AW071 755.1 


AI024877.1 


1 

! 


AF113129.1 1 


i 


115260371 


115260371 


AB002059.1 1 






AL1 63206.2 1 


AL1 63282.2 | 


AL1 6321 8.2 | 


BE166574.1 1 


i 


114359131 


11435913] 


i 
< 


1 


U01317.1 ] 


4502582] 




Similar 
P) Hit 
.STE 
Jlue 




.OE-72 1 


.OE-72 




.0E-72| 


.OE-73 




.OE-73 1 




.OE-73 


.OE-73 1 






.OE-73 1 


.OE-73 1 






.OE-73 1 


.OE-73 1 


.0E-73| 


.OE-73 1 


.OE-73 1 




.OE-73 1 


.OE-73 1 


.OE-73 1 


.OE-73 1 


i. 


.OE-73 






























































Expression 
Sign^ 




3.82 1 


3.82| 






1.23 


?3' 




1 






1 






2.121 












3.36| 






0.99] 










0.96 


ORF SEQ 
ID NO: 


1 




26534| 


i 


27595] 


21213 




SI 


i 


§ 




27459 


276821 


276831 


252821 


25235| 


1 








262741 






i 








§ 














7383 


1348 


8197 






i 


S 


17254 




7465 




i 




s 


4743 




6121 






1715 


0760 




2135 


1 


i 






























































Probe 
SEQID 
NO: 




65011 


6501 


75321 


7532] 


1443 


8320 


1022 


i 




67151 


7385 




76141 








32611 


1 




i 


52151 


1818 


1818] 




1902 


2251] 


3144 


3503 1 
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/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



II 



II 



■11 



UP 



376 



wo 01/57274 PCT/USOl/00666 




377 




378 



wo 01/57274 



PCT/USOl/00666 



379 



/;/E:A^¥O01 57274 cpc; 



WO 01/57274 PCT/USOl/00666 




/;/E:A^¥O01 57274 cpc; 



WO 01/57274 



PCT/USOl/00666 



381 



wo 01/57274 



PCT/USOl/00666 



^ 1; 

lib 



II 

S3 



riiH 



■S - uj 



382 



wo 01/57274 PCT/USOl/00666 



Top Hit Descriptor 


hD43b05.x1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone iMAGE:30401 13 3' similar to 1 
SW:GAG2_HUMAN P102S4 RETROVIRUS-RELATED GAG POLYPROTEIN ; 


tw22g02.x1 NCI_CGAP_Bm52 Homo sapiens cDNA clone IMAGE:2260466 3' similar to TR:065245 1 
065245 F21 El 0.7 PROTEIN . ; | 


tw22g02.x1 NCLCGAP Bm52 Homo sapiens cDNA clone IMAGE:2260466 3' similar to TR:065245 
065245 F21E10.7 PROTEIN. ; | 


nuclear gene encoding mitocliondrial protein, mRNA | 


ns58g12.s1 NCl_CGAP_Pr2 Homo sapiens cDNA clone IMAGE;1 188838 similar to SW:RL29_HUMAN 
P47914 60S RIBOSOMAL PROTEIN L29. [1] ;contains element MSRI repetitive element ; 


i 

1 
< 

1 

! 

X 

S 
I 


601476802F1 NIH_MGC_68 Homo sapiens cDNA clone IMAGE:3879505 5' | 


at74a09,x1 Barstead colon HPLRB7 Homo sapiens cDNA clone IMAGE:2377720 3' similar to TR:Q1331 1 1 

Q13311 TAXl-BiNDiNG PROTEIN TXBP151. [1] ; | 


Human protein l^inase 0 substrate 80K-H (PRKCSH) gene, exon 7 | 


Human protein l<inase C substrate 80K-H (PRKCSH) gene, exon 7 | 


1 Homo sapiens mRNA for K1AA1276 protein, partal cds | 


Homo sapiens mRNA for KIAA1276 protein, partial cds . | 


Homo sapiens amyloid beta (A4) precursor protein (protease nexin-l 1, Alzheimer disease) (APP), mRNA | 


Homo sapiens amyloid beta (A4) precursor protein (protease nexin-ll. Alzheimer disease) (APP), mRNA | 




Homo sapfens amyloid beta (A4) precursor protein (protease ne)dn-ll, Alzheimer disease) (APP), mRNA 


Homo sapiens mRNA for KiAA1 1 01 protein, complete cds ( 


Homo sapiens 2,4-dienoyl CoA reductase 1 , mitochondrial (DECR1), mRNA | 


E 
f 


|Homo sapiens 959 kb contig between AML1 and CBR1 on chromosome 21q22; segment 1/3 | 


Homo sapiens breast cancer 1 , early onset (BRCA1 ), transcript variant BRCA1 -exon4, mRNA | 


Cardlomvopathy associated gene 5 | 


Top Hit 
Database 
Source 




|est human 1 


|est_human 1 


z 


EST HUMAN 


IeST HUMAN 1 


|est_human 1 


EST HUMAN 1 


z 


z 






z 


z 






z 


z 


z 


z 




EST HUMAN | 


Top Hit Acession 
No. 


BE044316.1 


|aI613519.1 


< 




i 

1 


i 

S 


|BE787143.1 


|aI833003.1 1 


IU50321.1 1 


IU50321.1 1 


|AB033102.1 1 


IAB033102.1 1 


? 


g 
S 


45021661 


i 


IAB029024.1 | 


i 


77062991 


IAJ229041.1 1 


1 


AW755254.1 | 


Most Similar 
(Top) Hit 

Value 


s 






2.0E-77 






1 2.0E-77I 






i 








12 


12 














I.OE-77I 


Expression 
Signal 




d 


d 


d 






























CM' 








ORFSEQ 
ID NO: 


i 




i 




24334 




25761 1 




27564| 


1 


i 




S 


§ 




1 




1 227271 




8 


1 


2481o| 


Exon 
SEQ ID 
NO: 


13855 






1 






1 




S 




§ 


10024| 








1 


12272 




1 141551 




1 


I5043I 


Probe 
SEQ ID 
NO: 


3947 


1 4315 




1 


1 


1 56251 


1 5745 1 


1 


1 


1 7489 1 
















3007 


4256 


1 







383 
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PCT/USOl/00666 




384 




385 



wo 01/57274 



PCT/USOl/00666 



1 




386 



/;/E:A^¥O01 57274 cpc; 

WO 01/57274 



PCT/USOl/00666 



If 



is 



387 



wo 01/57274 



PCT/USOl/00666 



00 ^ UJ 



III 



389 



wo 01/57274 PCT/USOl/00666 



Top Hit Descriptor 


hn98d02.x1 NCI CGAP Co14 Homo sapiens cDNA clone IMAGE:3035907 3' similar to SW:COPG_BOVIN 1 
P53620 COATOMER GAMMA SUBUNIT ; 


Homo sapiens mRNA for KIAA1345 protein, partial cds 1 


ws90h03.x1 NCI CGAP_Co3 Homo sapiens cDNA clone IMAGE:2505269 3' similar to TR:04381 5 04381 5 1 
STRIATIN. ; 


Homo sapiens rab3 interacting protein variant 2 mRNA, partial cds | 


Homo sapiens rabS interacting protein variant 2 mRNA, partial cds 1 


Human mRNA for amyloid A4(751) protein I 


Human cone photoreceptor cGMP-phosphodiesterase alpha' subunit gene, exons 2 and 3 1 




Homo sapiens mRNA for Death-associated protein kinase 2, complete cds 1 


Homo sapiens ligase 1, DNA, ATP-dependent (LIG1), mRNA | 


Homo sapiens vesicle trafficking protein sec22b (SEC22B) mRNA 1 


1 

i 
1 
1 

1 
1 
§ 


Homo sapiens calcineurin binding protein 1 (KIAA0330}, mRNA j 




Homo sapiens beta-ureidopropionase (L0C51 733), mRNA 


Homo sapiens transcobalamin II; macrocytic anemia (TCN2), mRNA | 


1 Homo sapiens N F2 gene | 


iHomo sapiens NF2 gene I 


Homo sapiens cullin 4A (CUL4A) mRNA, complete cds I 






1 Homo sapiens chromosome 21 segment HS21 C083 1 


|60:474072F1 NIH_MGC_68 Homo sapiens cDNA clone IMAGE:3877121 5' 


|60' 474072F1 NIH_MGC_68 Homo sapiens cDNA clone IMAGE:3877121 5' 1 


1 hg85c01 .x1 NCI CGAP_Kid1 1 Homo sapiens cDNA clone IMAGE:2952384 3' 




1 hg85o01 .x1 NCI CGAP Kid1 1 Homo sapiens cDNA clone IMAGb:29b2384 3' 


|Homo sapiens period (Drosophila) homolog 3 (PER3), mRNA 1 




III 


EST HUMAN | 




EST HUMAN 1 


^ 












& 


















z 


z 




z 


lEST HUMAN 1 


lEST HUMAN 


lEST HUMAN 




lEST HUMAN | 




|EST HUMAN 


TopHitAcession 

No. 


|aW779612.1 1 


i 

8 


1 


IAF263306.1 1 


8' 
< 




U20197.1 


I 


|AB018001.1 1 


11425281 


1 


1 


r 


1 11417871 1 




1 114179741 


Y18000.1 


Y18000.1 


i 
< 


i 
1 


1 


lALI 63283.2 1 




|BE784636,1 


|AW61 1542.1 1 


1 5453871 


|AW61 1542.1 


1 8567387 


IW26539.1 


Most Similar 
(Top) Hit 
BLAST E 




s 


4.0E-81 


4.0E-81 


s 


3 




3 


3 




3 


3 


3 


f <? 




i 


3 


3 


3 


3 




i 


2,0E-81 




3 


1 2.0E-81 


2.0E-81 


3 




Expression 
Signal 




«• 


























s 
















f: 






d 








ORF SEQ 
ID NO: 


i 






23746 


23747 






27042 


27396 








25007 


25253 


25254 


25228 






22100 


1 






22530 


22531 


234D4 


24256 






1 


D 

S a o 






1 










6860 


S 


1 










E 








i 




i 






12732 








19375 
































































Probe 
SEQ ID 
NO: 


1779 


3134 






1 


i 






e 




8522 


- 85?? 




I i 


i 


s 


r^?46 


1246 




1 


i 




i 


2802 


3707 


1 


i 







390 
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Top Hit Descriptor 


EST372729 MAGE resequences, MAGF Homo sapiens cDNA | 


2k45h09.r1 SoaresjDregnant uterus NbHPU Homo sapiens cDNA clone IMAGE:485825 5' similar to 1 
PIR:S52437 S52437 CDP-diacylglycerol synthase - fruit fly ; | 


tz45cD4.y1 NCI CGAP Brn52 Homo sapiens cDNA clone IMAGE:2291 526 5' | 


Human aconitate hydratase (AC02) gene, exon 3 | 




Homo sapiens polymerase (DNA directed), gamma (POLG), mRNA • | 




6021 37864F1 N IH_MGC_83 Homo sapiens cDNA clone IMAGE:4274535 5' | 


Homo sapiens polymerase (DNA directed), gamma (POLG), mRNA | 


N 1H_MGC_56 Homo sapiens cDNA clone IMAGE:3930228 5' | 


! 

o 

i 
1 

z 

1 
i 

X 

S| 
o 


N IH MGC 53 Homo sapiens cDNA clone IMAGE:3685483 5' | 


NIH MGC 9 Homo sapiens cDNA clone IMAGE;3838280 5' | 


N IH_MGC_9 Homo sapiens cDNA clone 1MAGE:3838280 5' | 


CM3-NN0059-140400-147-a12NN0059 Homo sapiens cDNA | 




MR0-CT0006-250599-019 CT0006 Homo sapiens cDNA |. 


EST372729 MAGE resequences, MAGF Homo sapiens cDNA | 


601 86771 4F1 N 1H_MGC_1 7 Homo sapiens cDNA clone IMAGE:41 1 0459 5' |i 




Homo sapiens HSPC288 mRNA, partial cds (' 


Homo sapiens HSPC2SS mRNA, partial cds \, 


Human CRFB4 gene, partial cds 


Human CRFB4 gene, partial cds |- 


Human GRFB4 gene, partial cds |, 


Homo sapiens mRNA for KIAA1327 protein, partial ods i; 


mRNA \ 


Homo sapiens hypothetical protein FU20461 (FLJ20461), mRNA |i 


§ 

1 

1 
< 

z 

i 

i 

0 
I 


1 

Q 
.1 

! 

i 
I 

1 




601 645051 F- 


1 


§ 


601577339F- 


§ 


Top Hit 
Database 
Source 


EST HUMAN 1 


EST_HUMAN | 


EST HUMAN | 


NT 1 


NT 1 


NT _ 


NT 




NT 1 


EST_HUMAN 1 


z 
1 

X 


Z 
g 
X 


EST HUMAN | 


EST HUMAN 1 


EST HUMAN 1 


z 

1 

I 


EST_HUMAN 1 




EST HUMAN | 


NT 1 


NT 1 


NT 1 


NT 




NT 


NT_ 




NT ^ 


EST HUMAN | 


EST HUMAN | 


EST_HUMAN | 


TopHitAcession 
No. 


§ 
< 


AA040370.1 


BE047996.1 1 


Ua7928.1 1 


i 


§ 


U52351.1 1 


BF674641.1 






BE958278.1 | 


BE564367.1 | 


BE744545.1 | 


BE744545.1 1 


AW897550.1 1 


i 

& 
< 


i 
< 


AW960658.1 1 


BF204253.1 | 


114181381 


API 51 406.1 1 


AF161406.1 1 


i 


i 


i 


i 
< 


\ 671 5601 1 




i 


AU144050,1 1 




IVIost Similar 
(Top) Hit 
BLAST E 
Value 


1. OE-81 


2 


.OE-81 


=9 


.OE-81 


3 


■OE-81 


! S 


.OE-81 


.OE-81 


.OE-81 


.OE-81 


.OE-81 


3 




3 


3 


3 


.OE-81 


3 


8.0E-82 


8.0E-82 


^ 


8.0E-82 


8.0E-82 


3 


8.0E-82 


8.0E-82 


7.0E-82 


7.0E-82 


5.0E-82 








































Expression 
Signal 






S 
















2.62 


4.81 1 


8 


^ 








2.42| 
























« 




ORFSEQ 
ID NO: 


233041 


1 


242221 


29103 


249821 


1 




25750 




277021 


27703] 


277991 


1 


27910 


i 




i 


233041 


1 


IS 


i 


i 


200441 


i 




212441 


1 21400 


23825, 






23713| 


Exon 
SEQ ID 
NO; 




J 


1 






5206 


S I 
S £ 


8 i? 
? S 


6425 


7482 


f 


7576 




1 


7850 


8312 




3516 


i 


g 


i 


1 




0726 


g 




11540 


S 


1339 


§ 


g 






























































Probe 
SEQ ID 
NO; 


i 




1 


52031 


i 




is 


5737 


i 


76311 


7631 


77261 


7819 


78191 


80001 


1 


8438 


i 




i 










i 


1475 




4150 


14341 




4034 



391 
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1 

a 
I 


[Homo sapiens alpha-tubulln isoform 1 mRNA, complete cds | 


wp75e09.x1 NCI CGAP Bm25 Homo sapiens cDNA clone IMAGE:2457624 3' similar to TR:0 75276 1 
075276 PKD1 ; | 


Homo sapiens presenilin-1 gene, exons 1 and 2 | 


Ihomo sapiens amyloid beta (A4) precursor protein (protease nexin-l 1 , Alzheimer disease) (APP), mFiNA ] 


|RC2-BN0120-010400-013-f02 BN0120 Homo sapiens cDNA 1 


|Homo sapiens transforming growtii factor beta-activated kinase-binding protein 1 (TAB1 ), mRNA | 


Homo sapiens amyloid beta (A4) precursor protein (protease nexin-ll, Alzheimer disease) (APP), mRNA 


|ai23e05.s1 SoaresJesfis_NHT Homo sapiens cDNA clone 1343648 3' 1 


RCe-PT0001-190100-021-B02 PT0D01 Homo sapiens cDNA | 


Homo sapiens chromosome 21 segment HS21 0085 | 


1 

a 
1 
t 
1 

1 
c6 

1 
1 








S 

pi 

1 

11 
ip 
2 0 

Ii 

ii 

It 

n, 

g < 

u 




IHomo sapiens mRNA for KIAA1077 protein, partial cds 1 


Homo sapiens mRNA for KIAA1 077 protein, partial cds 1 




|Homo sapiens mRNA for K1AA0999 protein, partial cds | 


i 

1 
1 

1 

1 

1 

D 


Homo sapiens DNA for amyloid precursor protein, complete cds | 


Homo sapiens glutamate receptor, ionotropic, kainate 1 (GRIK1 ) mRNA | 


Homo sapiens mRNA for KIAA1 095 protein, partial cds | 


Homo sapiens mRNA for KIAA1096protein, partial cds . 1 


Homo sapiens wbsorl (WBSCR1) and wbscrS (WBSCRS) genes, complete cds, alternatively spliced and 1 
Ireplication factor C subunit 2 (RFC2) gene, complete cds 








Source 


z 


EST HUMAN | 


z 


2 


EST HUMAN 1 


z 




EST HUMAN 1 


EST HUMAN | 




EST HUMAN 1 






EST HUMAN 


z 




z 




z 


EST HUMAN 1 


z 


z 


z 


z 


z 




z 


TopHitAcession 




|AF081484,1 1 


i 
< 


IAF029701.2 1 




|BE006705.1 1 


1 51747021 


4502166 


|AA725848.1 | 


|AW875073.1 | 


|AL1 63285.2 | 


|BE813232.1 1 


4501922 




AA1 35979.1 


1 












i 

Q 


4504116] 


< 


< 


AF045555.1 


4507580] 


1 




i 
< 


i 


< 


1AL046390.1 


t\flost Similar 
(Top) Hit 
BIJ^STE 
Value 






1 4.0E.82I 


3.0E-82 


3 


3 


3.0E-82 


3 


3.OE-82I 




3.0E-82I 


3.OE-82I 


3.0E-821 


3.0E-82 


3.0E-82I 


3.0E-82I 




2.0E-82I 


3 


3 


3 


2.0E-82I 


2.0E-82] 


2.0E-82| 


2.0E-82 


2.0E-82| 


3 


1 


1 
w 








1 I3.75I 






















S 
















§ 


































ORFSEQ 


ID NO: 


1 2141 2| 






1 20061 


1 204461 


R 


§ 






1 212191 


1 21 630 1 






1 


i 


1 27728] 




1 20326| 


i 




i 






] 24144] 


1 


1 246531 


i 


SEQ ID 
NO: 




i 




s 


§ 


8 






1 








13158 


14718 




17504 


i 


a 


§ 


i 


i 


14040 


14352 


14352 


i 


1 


i 






















































Probe 
SEQ ID 
NO: 




i 












1045 


s 




1 


i 


1 3234| 


1 


i 


i 


i 




s 


i 




4140 






1 


s 


s 
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I 


iHomo sapiens mRNA for KIAA0727 protein, partial cds | 


Homo sapiens FAM4A1 splice variant a (FAM4A1 ) mRNA, complete cds | 


Homo sapiens slit (Drosophila) homolog 3 (SLITS), mRNA | 






|Homo sapiens CAGF9 mRNA, partial cds | 


|Homo sapiens CAGF9 mRNA, partial cds | 


1 

1 
< 

g 
X 

i 

3 
3 
& 
f 

s 
1 


Ui 

1 

1 
< 

§ 
I 

i 

1 

i' 
i' 

1 
1 




Homo sapiens melanoma differentiation associated protein-5 (MDA5), mRNA | 


i 
i 

1 
< 

1 

8' 

Z 

i 

i 


RC4-BT031 0-1 10300-015-flO BT0310 Homo sapiens cDNA | 


Homo sapiens mRNA for KIAA0538 protein, partial cds | 


UI-H-BW1-aoa-f-03-0-Ul.s1 NCLCGAP_Sub7 Homosapiens cDNA clone IMAGE:3084053 3' 1 


Homo sapiens chromosome 21 segment HS21 C009 | 




602150403F1 NIH MGC 81 Homosapiens cDNA clone IMAGE:4291 561 5' | 


60127334SF1 NIH_MGC_20 Homo sapiens cDNA clone IMAGE:3614362 5' 1 


za48f12.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:295823 3' | 


no12h01 .s1 NCI_CGAP_Phe1 Homo sapiens cDNA clone IMAGE;1 1 00497 3' similar to contains Alu 1 
repetitive element; | 


7p37a07.x1 NCI_CGAP_Pr28 Homo sapiens cDNA clone IMAGE:3647893 3' similar to TR:Q9Y3I6 Q9Y3I5 1 
DJ207H1.1 ; | 


Human platelet Glycoprotein lib (GPIIb) gene, exons 2-29 | 


hf31 h03.x1 Soares NFL T GBC SI Homo sapiens cDNA clone IMAGE:2933525 3' similar to | 
SW:YBEB HAEIN P44471 HYPOTHETICAL PROTEIN HI0034. ; | 


QV4-ST0234-181 199-037-f05 ST0234 Homo sapiens cDNA | 


? 
0 

i 

1 
< 

I 
1 

j 

1' 
1' 
s' 
g 

'■3 
p 




|Homo sapiens VAMP (vesicle-associated membrane protein)-assooiated protein A (33kD) (VAPA) mRNA, 1 
and translated products | 


Homo sapiens hyperlon gene, exons 1-50 | 




Top Hit 


Source 




z 




z 




z 




EST HUMAN 1 


s 




z 


[EST HUMAN 1 


|EST HUMAN | 


z 


Z 


z 


z 


|EST HUMAN | 


lEST HUMAN 1 


lEST HUMAN 1 


EST HUMAN | 


EST HUMAN 


z 


EST HUMAN | 


z 

i 


|EST_HUMAN | 










TooHitAoession 




IAB018270.1 1 


IAF234882.1 | 


1 113215701 


IY08032.1 1 


IY08032.1 1 


§ 


1 


IN94950.1 1 


1AA01 1278.1 1 


1 114180971 


1 11 545921 1 


IBE885106.1 1 


i 


IAB011110.2 1 


|BF515938.1 1 


lALI 53209.2 J 


< 


1BFS72220.1 | 


IBE383973.1 1 


1 


< 


BF221813.1 1 


IM33320.1 1 


< 


1 
1 


IAA701457.1 J 


114302411 


1 


!AJ01 0770.1 1 


11422024| 


s X 

W -Q 


BLAST E 
Value 


S 


2.0E-82I 




2.0E-82I 


2.0E-82I 


2.0E-82I 


^ 


2.0E-82I 


2.0E-82I 


2.0E-82I 


1.0E-82| 


1.0E-82| 


1.0E-82| 


9 


1 .OE-821 


1.0E-82| 


1 .OE-82 


9.0E-83I 


? 


8.0E-83| 


7.OE-83I 


7.0E-83 


6.0E-83I 


6.OE-83I 


6.0E-83| 


6.0E-83I 


6.0E-83I 


6.OE-83I 


6.0E-83| 


6.0E-83| 


















■n 
1 




S 


1 2.23| 






1 


1 


1 2.23 1 
















1 2.571 


1 1.761 




» 






5,49 


1 1.341 






1 0.941 


1 








ORFSEQ 


O 


1 25108| 


S 


1 


1 279121 


i 


1 


1 








203211 




210201 






1 282641 


1 285001 




21155 












22709 


1 22751] 


1 23231 1 


1 24923! 


1 25679 




Exon 
SEQ ID 
NO: 


15278 


15654 


16713 


17671 


s 


18493 




1 


1 




91-901 


§ 








s 


18250 


i 


1-1297 


12647 


12764 


§ 


10343 


11547 






13431 


15160 


15579 


1 






























































Probe 
SEQ ID 
NO: 


1 5358 




i 


1 78211 


1 


i 




1 








1190 


1264 


1265 




S 


1 83731 


1 70571 


13921 


i 


i 


4710 






i 


3031 




1 5236 


i 
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Top Hit Descriptor 




|wi67ii08.x1 NCI_CGAP_Kid12 Homo sapiens cDNA cione IMAGE:2398431 3' similar to contains element 1 
|mSR1 repetitive element; | 




|601452817F1 NIH_MGC_67 Homo sapiens cDNA clone IMAGE:3866021 5' | 


O 
1 

1 

i 
1 
1 

i' 
1 


|601109738F1 N1H_MGC_1 6 Homo sapiens cDNA clone IMAGE:3350553 5' | 


i 
< 

i 

1 

z 

1 

i' 
i' 

1 

1 


i 

a 

i 
! 

z 

1 

i' 
i' 

I 

1 


|601897003F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE:4126440 5' | 


601897003F1 NIH MGC 19 Homo sapiens cDNA clone IMAGE:4126440 5' 


Homo sapiens calcineurin binding protein 1 (KIAA0330), mRNA | 


601120778F1 NIH_MGG_20 Homo sapiens cDNA clone IMAGE:2967690 5' 


1 Homo sapiens KIAA0680 gene product (KIAA0680), mRNA 1 




|aj88f08.s1 Soares_parathyroid tumor NbHPA Homo sapiens cDNA clone IMAGE:1403559 3' | 


|Homo sapiens Taxi (iiuman T-cell leul<emia virus type 1) binding protein 1 (TAX1BP1), mRNA | 


Homo sapiens galaclocerebrosidase (GALC) gene, exon 1 5 | 


1 Homo sapiens RAN binding protein 7 (RANBP7), mRNA \ 


Homo sapiens DiGeorge syndrome critical region gene 6 (DGCR6), mRNA p 


1 Homo sapiens similar to transcription factor CA1 50 (H. sapiens) (LOG631 70), mRNA |= 


Homo sapiens similar to transcription factor CA1 50 (H. sapiens) (LOC631 70), mRNA \ 


|Homo sapiens oxoglutarate deliydrogenase (lipoamide) (OGDH) mRNA ^ 


I601072594F1 NIH_MGC_12 Homo sapiens cDNA clone IMAGE:3458830 5' |; 


|60117S865F1NIH_MGC_17HomosapienscDNAclonelMAGE:3531953 5' f 




1 

1 

< 

1 

t 

1 

o 
o 

o 
z 


|601 509696F1 NIH MGC 71 Homo sapiens cDNA clone IMAGE:3911303 5' p 




|tu18b02.x1 NCIj:GAPJ'r28 Homo sapiens cDNA clone IMAGE:2251 371 3' |^ 


|601 302333F1 NIH MGC 21 Homo sapiens cDNA clone IMAGE:3636753 5' |. 


|EST1 77232 Jurkat T-cells VI Homo sapiens cDNA 5' end \i 


1 Homo sapiens chromosome 21 segment HS21 C003 \j 


Top Hit 
Database 
Source 




EST HUMAN 




|EST HUMAN | 






|EST HUMAN 


lEST HUMAN 


lEST HUMAN 1 


EST HUMAN 


E z 


EST HUMAN 


z 


[EST HUMAN 


|EST HUMAN | 
















lEST HUMAN 1 


|EST HUMAN 1 


|EST HUMAN | 


iEST_HUMAN 1 


lEST HUMAN 1 


lEST HUMAN 


|EST_HUMAN 


lEST HUMAN 


|EST HUMAN | 


z 


TopHitAcession 
No. 


|AL1 63284.2 | 


|a|760820.1 


IBE794306.1 | 


IBE618392.1 | 


i 


IBE257917.1 | 


IAA778785.1 | 


1 
1 


|BF31 1552.1 1 


BF31 1552.1 


11417862 


IBE274217.1 1 




8 
8 

1 


I 




jL38557.1 1 


\ 54539971 




1 114170121 






|BE547173.1 1 


|BE295843.1 1 


1BE547173.1 | 


1 AW 340946.1 1 




s 
a 


1 


|BE41 0354.1 1 


|AA306264.1 | 


8 

5 


Simiiar 
p)Hit 
,STE 
3lue 


2.0E-85 


£ 


1. OE-85 1 




£ 


1 .OE-85 


198-301 




1. OE-85 1 


1. OE-85 1 


1. OE-85 


).0E-86 


? 


s 


'.OE-86 1 




s 


» 


i 


'.OE-86 


'.0E-86| 


).0E-86| 


S 


i.0E-86| 






i.0E-86| 


3 


S 


i.0E-86l 


!.0E-86| 


1 




































































Expression 
Signal 
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ORF SEQ 
ID NO: 




i 




1 221211 




1 27704j 




i 


284921 




252831 




20001 


g 


206921 


E 
% 


1 




§ 


1 


284561 












1 


1 279641 


1 28125! 










14705 


1 


12125 


1 


12224 


1 






18242 


18242 






1 


10845 


10845 




i 








18206 




10175 


1 








17718 


1 


09961 


i 


,0354 




































































Probe 
SEQ ID 
NO: 








1 2344 


i 


1 7632 


1 


3 


1 


11 












6164 


1 7078] 


7593 


1 76241 


8329 








1 56771 




1 


1 


1 


i 


1 9163 




1 
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1 0547781 


1 0547781 














t 
1 












































i:05477a 


9. 














cds, altei 








3 

k 

E 


S 

M 
1 


































34S97 3' similar to TF 


34897 3' Similar to TF 










i 




nns 7-49, and partial 








elMAGE:212190 3's 
E, CYTOPLASMIC ; 


elMAGE:212190 3'E 
E, CYTOPLASMIC ; 
















< 










0 

1 
§ 
0 


CD 

i 
1 




Top Hit Descriptor 


Irir81d09.x1 NCI CGAP Kid1 1 Homo sapiens cDNA clone IMAGE:31 
SOLUTE CARRIER FAMILY 22 -LIKE 2 PROTEIN ; 


hr81d09.x1 NCI_CGAP_Kld11 Homo sapiens cDNA clone iMAGE:31 
SOLUTE CARRIER FAMILY 22 -LIKE 2 PROTEIN ; 


[Homo sapiens chromosome 21 segment HS21 C046 


IHomo sapiens chromosome 21 segment HS21 C046 


Homo sapiens chromosome 21 segment HS21 C046 






I7e36f08.x1 NCI_CGAP_Lu24 Homo sapiens cDNA clone IMAGE:328 


Homo sapiens calcium channel alphalE subunit (CACNA1E) gene, e> 


ai63d08.s1 Scares Jestis^NHT Homo sapiens cDNA clone 1375503 0 


|501655837R1 N1H_MGC_66 Homo sapiens cDNA clone IMAGE:385I 


|601655837R1 NIH MGC 66 Homo sapiens cDNA clone 1MAGE:385I 


yr86e04.El Scares fetal liver spleen 1 NFLS Homo sapiens cDNA clon 
SP:C1TC_HUMAN P11586 C-1-TETRAHYDROFOLATE SYNTHAS 


|yr86e04.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clon 
SP:C1TCJUMAN P11586 C-1-TETRAHYDROFOLATE SYNTHAS 


i 


H. sapiens ECE-1 gene (exon 6) 


|Homo sapiens hypothetical protein FLJ10388 (FLJ10388), mRNA 


Homo sapiens hypothetical protein FLJ10388 (FLJ10388), mRNA 


|Homo sapiens HsGCNI mRNA, partial cds 


|Homo sapiens HsGCNI mRNA, partial cds 


iHomo sapiens inositol 1,4,5-friphosphate receptor, type 3 (ITPR3) mR 




|Homo sapiens TCL6 gene, exon 1-10b 




Homo sapiens intersectin long isofomi (ITSN) mRNA, complete cds 


Homo sapiens pregnancy^zone protein (PZP) mRNA 


j 

1 

i 

i 

! 
1 


Q 

.1 
t 

i 

i 

i 

i 

i' 
« 




Top Hit 
Database 
Source 


EST HUMAN | 


EST_HUMAN 


iNT 1 


iNT 1 






i 


[EST HUMAN 1 


u 


|EST_HUMAN | 


[EST HUMAN 1 


|EST HUMAN | 


EST HUMAN 


EST HUMAN | 


iNT 1 


iNT 1 


iNT 1 








iNT 1 


:nt 1 


iNT 1 


'm \ 


NT 


!NT 1 


|EST_HUMAN 1 






p HitAcession 
No. 


i 


1 




163246.2 1 


163246.2 1 


i 


i 


s 




i 




3S2525.2 1 


i 




1926.1 1 






1 


i 


^700.1 1 


45047941 


45047941 


D35344.1 1 


i 




g 


1 


e 


135549.1 1 






BF- 


< 


< 


< 




BB 


BE( 


af: 














§ 














s 


AF' 




aa; 


aa; 




Most Similar 
(Top) Hit 
BLAST E 
Value 




S 


9.0E-90I 


9.0E-90I 


3 


8.0E-90I 


I 

(6 


8.0E-90I 


7.0E-9o| 


7.0E-90| 




7.0E-901 


3 


3 


3 


6.0E-90I 


6.OE-90I 


3 


6.OE-90I 


6.OE-90I 


6.OE-90I 


S 


5.0E-90I 


5.0E-90I 


5.0E-90| 


5.0E-90I 


5.0E-90I 


5.0E-90| 


5.0E-90I 


Expression 
Signal 


















2.65 












1 1.141 




1 7.331 








1 3.251 


1 3.251 


1 10.51 






p 






d 


ORFSEQ 
ID NO: 


1 


i 


1 


§ 


i 




s 


§ 






27250 


1 27251 1 


27927! 




1 227491 


1 227501 


1 














1 209311 


1 


1 24124| 


1 




1 


.. 

S oo 


1 




§ 


1 


§ 


1 


i 


1 




i 


§ 


§ 


1 


1 




12957 


14034 




15554 


1 


16725 


1 


10123 


11087 


s 


143341 


1 


1 


1 






























































Probe 
SEQID 
NO: 


1 


i 


i 


i 


1 10471 


1 




1 13081 




i 


1 71831 


1 7183] 


7833 


1 7833! 


1 3029 1 


1 


1 4134| 


1 41341 


1 


i 


1 


1 




1175 




1 






i 
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Top Hit Descriptor 


H.sapiens mRNA encoding phospholipase c | 




Homo sapiens angiopoietin 4 (ANG4) mRNA, parflal cds 1 


Homo sapiens adenylate cyclase 9 (ADCY9) mRNA | 


Homo sapiens hypothetical protein FU13222 (FU13222), mRNA 1 


mRNA 1 




Homo sapiens gene for AF-6, complete cds | 


ar78h05.x1 Barstead aorta HPLRB6 Homo sapiens cDNA clone IMAGE:21 28761 3' 1 


Homo sapiens chromosome 21 unknown mRNA | 


Homo sapiens chromosome 21 unknown mRNA | 


Homo sapiens myosin phosphatase, target subunit 1 (MYPT1). mRNA 1 


H.sapiens gene encoding discoidin receptor tyrosine kinase, exon 16 , | 


Homo sapiens DNA for amyloid precursor protein, complete cds 1 


Homo sapiens mRNA for KIAA1244 protein, partial cds | 




601 335244F1 NIH MGC 39 Homo sapiens cDNA clone IMAGE:3689147 5' | 


601 067378F1 N1H_MGC_1 0 Homo sapiens cDNA clone IMAGE:3453834 5' ]: 






qc54c02jc1 Soares_placenta 8to9weeks 2NbHP8to9W Homo sapiens oDNA clone IMAGE:1 71 341 0 3' 1 
similar to SW :0LF3_M0USE P23275 OLFACTORY RECEPTOR 0R3. ; | 


Homo sapiens mRNA for KIAA0289 gene, partial cds 1 


Homo sapiens GRB2-related adaptor protein (GRAP) mRNA 1 


ba49d05.y3 NIH_MGC_10 Homo sapiens cDNA cione IMAGE:2899881 5' similar to TR:O75208 075208 1 
HYPOTHETICAL 35.5 KD PROTEIN. ; 


Homo sapiens similar to laminin receptor 1 (67kD, ribosomal protein SA) (H. sapiens) (LOC63484), mRNA || 


Homo sapiens similar to laminin receptor 1 (67kD, ribosomal protein SA) (H. sapiens) (LOC63484), mRNA [ 


AU118985 HEMBA1 Homo sapiens cDNA clone HEMBA1004795 5' |: 


Top Hit 
Database 
Source 


NT 1 


NT 1 




NT 




NT 1 


NT 1 


NT 1 


EST HUMAN 1 


NT 1 


NT 1 


NT 


NT 1 


NT 


NT 1 


NT 


EST HUMAN | 


1 

I 


NT 1 


NT 


'est HUMAN 1 


NT 


MT 


EST HUMAN 1 


NT 1 


NT 


EST_HUMAN | 


Top HitAcession 
No. 


Z16411.1 1 


Z16411.1 1 


i j 

< < 


1 
I ^ 


1 


1 II419429I 


11 433721 1 


AB011 399.1 1 


AI523366.1 | 


AF231 920.1 1 


AF231 920.1 1 


450531 B| 




D87675.1 1 


AB033070.1 1 


i 
1 


BE563833.1 | 




1 


50317481 


AI138213.1 1 






i 

1 


11 427320 1 


R 
1 


AU 118985.1 1 


Most Similar 
(Top) Hit 

Value 


5.0E-90I 


5.0E-90I 






s 


5.0E^0 


5.OE-90I 


5.OE-90I 


? 

s 


4.OE-9OI 


4.OE-90I 


4.0E-90| 


4.OE-90I 


4.OE-90I 


4.OE-90I 


4.0E-901 


3.OE-90I 




i 


2.OE-90I 


2.OE-90I 


2.OE-90I 


2.OE-90I 




2.OE-90I 


2.0E-90 


2.OE-90I 


Expression 
Signal 


2.52| 




^ ? 


^ 












1.851 




















3.651 


2.81 1 


1.131 


s 










ORFSEQ 
ID NO: 


25409| 


254091 


11 






27646| 


§ 






g 


200841 


1 




242351 


i 


24385 1 


290041 


1 


i 


209141 




24264| 








1 




Exon 
SEQID 
NO: 


153531 


s 
s 




16323 




1 




1 


i 






i 




1 


14577| 


14599| 








11069| 


s 


14476| 






1 


1 




Probe 
SEQID 
NO: 


1 5433 1 




ii 




i 




1 


9744 


i 






1070| 


1 


1 


1 


4713 


1 




1156 




1 37711 


1 


4820 


5534 


1 


i 


7722 
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Top Hit Descriptor 


IAU118985HEMBA1 Homo sapiens cDNA clone HEMBA1004795 5' | 




Homo sapiens amyloid beta (A4) precursor protein (protease nexin-II, Alzheimer disease) (APP), mRNA 


[Homo sapiens chromosome 21 unknown mRNA • | 


[Homo sapiens chromosome 21 unknown mRNA | 


1 Homo sapiens mRNA for T-box transcription factor (TBX20 gene), partial | 


|HomosapiensmRNAforT-boxtranscriptionfactor(TBX20gene), partial | 






1 Homo sapiens Kruppel-like factor 7 (ubiquitous) (KLF7), mRNA | 






|601159563F2NIH MGC 53 Homo sapiens cDNA clone lhMGE:351 11 18 5' | 


|Homo sapiens similar to SALL1 (sal (Drosophila)-like (LOC57167), mRNA | 




Homo sapiens mRNA for KIAA0903 protein, partial cds | 


1 Homo sapiens mRNA for KIAA0903 protein, partial cds | 


Homo sapiens soluble interleukin 1 receptor accessory protein (1L1 RAP) gene, exon 8, alternafive exons 9 1 
and complete cds, alternatively spliced f 


1 Homo sapiens mRNA for KIAA0633 protein, partial cds | 


mRNA I 


1 Homo sapiens brefeldin A-inhibited guanine nucleofide-exchange protein 2 (BIG2), mRNA 1 


Homo sapiens SNCA isoform (SNCA) gene, complete cds, alternatively spliced | 


|Homo sapiens CGl-15 protein (LOC51006), mRNA f 


1 Homo sapiens CGI-1 5 protein (LOC51 006), mRNA ) 


Homo sapiens DNA for Human P2XM. complete cds f 


1 Homo sapiens DNA for Human P2XM, complete cds |j 


|HUM000S381 Liver HepG2 cell line. Homo sapiens cDNA clone s381 3' | 


1 Rattus norveglcus brain specific cortactin-binding protein CBP90 mRNA, partial cds |i 


iHomo sapiens makorin, ring finger protein, 1 (MKRN1), mRNA I 




|AU143539 Y79AA1 Homo sapiens cDNA clone Y79AA1 002087 5' 1; 


Top Hit 
Database 
Source 


|EST HUMAN 1 




1- 
z 


z 




1- 
z 


z 


z 


z 






z 


lEST HUMAN 1 


Z 


z 






z 


z 






z 


z 






z 


]EST HUMAN | 


z 


z 


]EST HUMAN | 


]EST_HUMAN | 


TopHitAcession 


|AU1 18985.1 1 


1 110247111 


1 
1 


IAF231920.1 1 


|AF231 920.1 1 


IAJ237589.1 | 


IAJ237589.1 | 


IAF264750.1 | 


IAF264750.1 | 


1 


|AF096154.1 1 


|AF096154.1 1 


|BE379884.1 1 


11420514 


1 6005720] 


|AB020710.1 ] 


IAB020710.1 ] 


AF1 67340.1 1 


]AB01 4533.1 1 


1 11426758 


1 11422086] 


i 
< 


1 


1 11422109] 


i 

i 

< 


IAB002059.1 ] 


]D1 2234.1 1 


i 
i 




IAA702794.1 ] 


]AU1 43539.1 ] 


Most Similar 
(Top) Hit 
BLAST E 
Value 


1 2.0E-90| 


1 2.0E-90I 






s 


1 .OE-90 1 


s 


l.0E-90| 




s 


S 


l.0E-90| 






.0E-90| 


1 .OE-90] 


? 


05-30-1 


.OE-90] 


1 .OE-90 


] 05-30-1 


.OE-90] 




1 .OE-90 




1 .OE-90] 








1 5,0E-91| 


] 5.0E-91 ] 
























































Signal 










1 1-04| 










2.45 






















1 3.78] 




1 1.72] 








1 


» 








ORFSEQ 
ID NO; 


1 






1 20148| 


1 20148| 


1 20435] 


1 20436j 


1 204721 


1 




1 210441 


1 210451 




21631] 








24005 


25464] 




1 27189] 




1 


^ 










g 




] 24092] 


Exon 
SEQID 
NO: 


17572 


17914 


10241 


i 




10613 


10613 


1 


i 








1 






i 




1 


15401 










17240 


1 


i 


14001 




16714 


13346 


14309 
































































Probe 
SEQID 
NO: 


7722 












i 






i 


i 




1 


1 


2823 


1 37771 


3777 


432m 


5481 


1 6521 1 


7121 


1 73561 




7371 


9732 


9732 




1428 


i 


3429 


5 
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Top Hit Descriptor 


;AU143539Y79AA1Homosapien£cDNAoloneY79AA1002087 5' | 






tAV649878 GLC Homo sapiens cDNA clone GLCBYF08 3' | 


|AV649878 GLC Homo sapiens cDNA clone GLCBYF08 3' | 






Homo sapiens cliromosome 21 segment HS21 C084 | 


EST01579 Hippocampus, Stratagene (cat. #936205) Homo sapiens cDNA clone HHGMC60 similarto 1 
1 Retrovirus-related gag polyprotein | 


EST01579 Hippocampus, Stratagene (cat. S936205) Homo sapiens cDNA clone HHCMC60 similar to 1 

Retrovirus-related gag polyprotein | 


Homo sapiens solute carrier family 4, anion exclianger, member 3 (SLC4A3), mRNA | 


Homo sapiens solute carrier family 4, anion exclianger, member 3 (SLC4A3), mRNA | 




Homo sapiens beta-ureidopropionaEe(BUHI) gene, exon 6 | 


Homo sapiens chromosome 21 segment HS21 C083 | 


|Homo sapiens mRNA for K1AA1278 protein, partial cds | 


Homo sapiens mRNA for KIAA1278 protein, partial cds | 


IHomo sapiens cyclin-D binding Myb-like protein mRNA, complete cds f 


iHuman Ku (p70/p80) subunit mRNA, complete cds | 


Homo sapiens chromosome 21 segment HS21C085 [ 


[Homo sapiens chromosome 21 segment HS21 C085 \ 


iHomo sapiens epididymal secretory protein (19.5kD) (HE1). mRNA I 


Homo sapiens cyclin-dependent kinase 6 (CDK6) mRNA [ 








[Human L-type calcium channel beta-1 subunit (CACNLB1) gene, exons 10 and 1 1 |; 


I Human mRNA for very low density lipoprotein receptor, complete cds |, 


|genes, complete cds f 




Database 


,EST HUMAN | 






|EST HUMAN- | 


1 
§ 
^1 








EST HUMAN 


i 
5 

3 
I 


















z 












z 






z 




1 Ton Hit ArACAion l 




iAU143539.1 1 




i 


1 


1 


|AF1 56776.1 | 


< 


|AL1 63284.2 | 




i 


1 114301931 




|AF1 69555.1 | 


|AF1S9555.1 1 


IAL163283.2 | 


|AB033104.1 1 


< 


|AF0a4530.1 1 


|M30938.1 1 


|AL1 63285.2 | 


< 


1 114349641 


1 




1 1149761l| 






i 

5 


1 
< 


Most Similar 
(Top) Hit 
BU\STE 
Value 


5.0E-91I 


5.0E-91I 


5.0E-91I 


5.0E-91I 


3 


4.0E.91| 


s 


4.0E-91 1 


4.0E-91 


k 


3.0E-9l| 


<? 


3.0E-91I 




3.0E-91I 






3.0E-91 1 






3.0E-91I 


3.0E-91I 




s 


S 


3.0E-91I 


3.0E-91I 


S 


s 


1 


Signal 


















& 








1 


1 
































D 

c 


ID NO: 


§ 


1 243791 


1 24380 1 


1 27157| 


1 27158| 


1 228951 


i 




25314 


§ 




1 213671 






1 23020 1 


23136) 








8 


1 24561 1 


1 25470 1 




i 


1 


1 265351 


1 265361 


1 271631 


25286| 


1 Exon 


SEQ ID 
NO: 








1 


1 


13091 


13091 


s 


i 


§ 


1 


1 


1 


12492 


13210 


13333 




8 


i 


S 


14785 


15407 


15712 




15872 




i 




I9IO4I 


























































Probe 


SEQ ID 
NO: 


1 441 5| 


1 4703 


4703 


1 7087| 


1 7087j 


1 




1 83011 






1 


1 


i 


i 


1 3297| 


1 3416 


3416 


1 


1 4487, 


1 49051 


i 


1 


1 


1 


§ 


1 6502 


1 65021 


7093 


1 
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|3 



III 

I 



410 




411 



wo 01/57274 PCT/USOl/00666 
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III 



1 X |_ t 

8 tS^ 
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Top Hit Descriptor 


Homo sapiens tensin mRNA, complete cds | 


"8 

< 

E 
& 
O 

5 

} 

a: 

1 
o 


SO1 1 1 7586F1 NIH MGC 1 S Homo sapiens oDNA clone IMAGE:3358220 S \ 


S01116810F1 NIH_MGC_16 Homo sapiens oDNA clone IMAGE:3357243 5' | 




Homo sapiens hypothetical protein (LOC51 31 8), mRNA | 


UI-HF-BNO-aks-g-09-O-Ul.n NIH MGC 50 Homo sapiens cDNA clone IMAGE:3078329 5' | 


qp78b10.x1 Scares fetal lung NbHLISW Homo sapiens oDNA clone IMAGE:1929115 3' | 


zl29c1D.s1 Scares _pregnant_uterus__NbHPU Homo sapiens cDNA clone IMAGE:503346 3' 1 


Homo sapiens CYP17 gene, 5' end 1 


S01458531F1 NIH MGC SSHomooapionsoDNAolonolMAGE:38620865' | 


Homo sapiens CTR1 pseudogene 1 


Homo sapiens CTR1 pseudogene | 


Homo sapiens hypothetical protein (DJ328E19.C1.1), mRNA | 


oy84b08.x1 NCI CGAP CLL1 Homo sapiens cDN A clone IMAGE: 1672503 3' Similar to TR;Qe2384 0623841 
ZINC FINGER PROTEIN. ; 


Homo sapiens DNA for amyloid precursor protein, complete cds | 




Homo sapiens hypotheScal protein FLJ20291 (FU20291 ). mRNA 1 


Homo sapiens mRNA for KIAA1563 protein, partial cds [ 






Homo sapiens MHC class 1 region ( 


Novel human gene mapping to chomosome 1 [ 


601 1 77S86F1 NiH MGC 1 7 Homo sapiens cDNA clone IMAGE:3532965 5' ( 


1 
CD 

i 

1 
< 

■1 
i 

o 

C5 
5 

i 


Homo sapiens DNA for amyloid precursor protein, complete cds J 


Homo sapiens glucocorticoid receptor (GRL) gene, Intron D, exon 5, and intron E 


Homo sapiens glucocorticoid receptor (GRL) gene, intron D, exon 5, and intron E | 




Homo sapiens protein kinase C, beta 1 (PRKCB1 ), mRNA I 


Top Hit 
Database 


z 


z 


EST HUMAN | 


EST HUMAN ] 


EST HUMAN i 


Z 


EST HUMAN j 


EST HUMAN | 


EST_HUMAN 1 




EST HUMAN 


Z 




Z 


EST HUMAN 








Z 


Z 




z 




EST HUMAN | 


i 

=) 

X 


Z 2 


E z 


z 




z 


Top Hit Acession 
No. 


AF225896.1 | 


U40733.1 1 




BE253201.1 1 


AW964385.1 1 


1 


AW 502002.1 1 


AI312C25.1 1 


AA1 26735.1 1 


L41 825.1 1 


BF035327.1 | 


AF238997.1 1 


AF238997.1 | 


76570161 


AI146755.1 


D87675.1 1 


i 


e 
1 


AB04S783.1 | 


AF1 67705.1 1 


AF231 981.1 


i 

< 


< 


BE2973S9.1 | 


i 


D87675.1 1 


ii 


U78509.1 1 


1 


§ 


Most Similar 
(Top) Hit 

Value 


2.0E-93 


2.0E-93 


3 


2.0E-93 


S 


2.0E-93 


2.0E-93 


2.0E-93I 




2.0E-93I 


2.0E-93 










.OE-93 1 


.OE-93 1 


.OE-93, 


.0E-93| 






.OE-93 1 










.OE-93 


.OE-93 1 


1 .OE-93 


.OE-93 1 








































Expression 
Signal 
















































^ 


i 




^2 








ORF SEQ 

ID NO: 




i 


i 


2473^ 


i 


254421 




8 








1 


1 






20631 1 


209741 




21 083 1 


21085 


22070 


22190 




21030| 




II 


11 








Exon 
SEQ ID 
NO: 


1 


19731 




1 


5245 




5918 




S 


96061 


1 






1 






s 




1228 


i 


12172 




2336 






11 






































































Probe 
SEQ ID 
NO: 


1 


2083 


i 


§ 


i 


5462 


6014 


i 


1 


1 


97241 






s 










1321| 


i 






i 


2792 


2792 




5417 


5417 


1 5607I 


6067 



414 
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Top Hit Descriptor 


Human mRNA for NF1 N-isoform-exon1 1 , complete cds 1 


Homo sapiens nnRNA for KIAA1 41 1 protein, partial cds | 


H .sapiens mRNA for MEMO protein 1 


Homo sapiens protein l<inase inhibitor gamma (PKIG) mRNA, complete cds 1 


IHomo sapens mRNA for KIAA1485 protein, partial cds j 




Human PreA4 gene for Alzheimer's disease A4 amyloid protein precursor (exon 9) | 


Human PreA4 gene for Alzheimer's disease A4 amyloid protein precursor (exon 9) 1 


|qm03c12.x1 NCI CGAP Lu5 Homo sapiens cDNA clone IMAGE:18807583' similar to WP:ri9B4.4 | 
!CE13742 ; 


Homo sapiens glutathione S-transferase theta 2 (GSTT2), mRNA 1 


Homo sapiens chromosome 21 segment HS21C009 | 


Homo sapiens transcription enhancer factor-5 mRNA, complete cds 1 


1 Homo sapiens mitogen-actlvated protein kinase 1 2 (MAPK1 2). mRNA 1 


Homo sapiens mRNA for KIAA0612 protein, partial cds 1 


1 Homo sapiens mRNA for KiAA0612 protein, partial cds 1 


|zg87g06.s1 Scares fetal heart NbHH19W Homo sapiens cDNA clone IMAGE:409594 3' 


|ot83d05.s1 Soares total fetus Nb2HF8_9w Homo sapiens cDNA clone IMAGE:1623369 3' 


|yd98b04,El Soares fetal liver spleen 1NFLS Homo sapiens cUNA clone IMALib:1 16239 3' 1 


1 Homo sapiens mRNA for KIAA0027 protein, partial cds 1 






[Homo sapiens protein phosphatase 1 , regulatory subunlt 10 (PPP1 RIO) mRNA 1 


tw11f10.x1 NCLCGAP_Brn52 Homo sapiens cDNA clone IMAGE:2259403 3' similar to TR:Q15265 Q1 5265 
1 PROTEIN TYROSINE PHOSPHATASE ; 


1 Homo sapiens solute carrier family 22 (organic cation transporter), member 1-iike (SLC22A1 L), mRNA | 


Inomo sapiens solute carrier family 22 (organic cation transporter), member 1-iike (SLC22A1L), mRNA 


< 

!l 

S ^ 
5 £ 

5 i 

Ii 

II 
^ « 

1 i 

.1 \ 

i: 


1 Homo sapiens complement component 5 (05) mRNA 




1 Homo sapiens oysteine-rich repeat-containing protein S52 precursor, mKNA, complete cds | 


Top Hit 
Database 


NT 1 


NT 


NT 1 


NT 1 


NT 


NT 1 


NT 1 


NT 


EST HUMAN 1 




NT 


,NT 1 


NT . 1 


NT 


NT 


lEST HUMAN 


lEST HUMAN 


lEST HUMAN! 


iNT 


NT 


iNT 


NT 


EST human! 


NT 


L 


hi — 





NT 


|NT 


Top Hit Acession 
No. 


D42072.1 1 


1 


Y1 0183.1 1 


AF182032.1 1 


1 
1 


AF091 395.1 


XI 3474.1 1 


XI 3474.1 1 




114178561 


i 

5 


,AF1 42482.1 1 


114183511 


IAB014512.1 1 


IAB014512.1 1 


IAA722434.1 1 


IAI015800.1 1 




ID25217.2 1 




IL05094.1 1 




AI591312.1 


1 


11440670 


1 


1 4502506 


IAFI 67706.1 


|AF1 67706.1 


ilil 


1 .OE-93 


3 


3 


1 .OE-93 


1 .OE-93 


1 .OE-93 


1 .OE-93 


1 .OE-93 




1 .OE-93 


3.0E-94 


3.0E-94 


3.0E-94 


5.0E-94 


3 


5.0E-94 


3 


5.0E-94 




3 


3 


4.0E-94 


4.0E-94 


4.0E-94 


4.0E-94 


4.0E-94 


3.0E-94 


3 




otcD 




























































Expression 
Signal 


4.09 1 


s 










4.54 


4.54 
















1.72 












i 
















ORFSEQ 
ID NO: 


26320 


i 












27590 


25132 






23579 










1 


24911 








22379 










I 1 


21477 




c S .. 




1 




i 


6450 




7380 






r9?.57 


I7R75 


13791 


i 


15217 






1 




r9378 


i 




1 




15798 


§ 




10637 


11607 
































































Probe 
SEQID 
NO: 


i 








r^7437l 












g 








1 




6183 


i 


1 




§ 


2621 


1 








i 1 




i 



415 
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Top Hit Descriptor 


Homo sapiens mRNA for KIAA1395 protein, partial cds 




zu84b01 .si Soar6S_tesfis_NHT Homo sapiens cDNA clone IMAGE: 744649 3' similar to contains LI .t1 L1 1 
1 repetitive element ; | 


Homo sapiens DNA for amyloid precursor protein, complete cds 1 


Homo sapiens DNA for amyloid precursor protein, complete cds 1 


Homo sapiens Ly-6-like protein (CD59) mRNA, complete cds | 


Homo sapiens ciiromosome 21 segment HS21 C046 


Human homeobox protein (PH0X1) mRNA, 3' end | 


602071 146F1 NCI OGAP Brn64 Homo sapiens cDNA clone 1MAGE:4214147 5' | 


Homo sapiens H factor 1 (complement) (HF1 ) mRNA 1 


Homo sapiens K1AA0255 gene product (K1AA0255). mRNA 1 


Homo sapiens KIAA0255 gene product (K1AA0255), mRNA 1 


mRNA 1 


601312161F1 NIH_MGC_44 Homo sapiens cDNA clone IMAGE:3558862 5' j 




Homo sapiens G protein-coupled receptor 1 9 (GPR1 9) mRNA | 


Homo sapiens glutathione S-transferase theta 2 (GSTT2) and glutathione S-transferase theta 1 (GSTT1 ) 1 
genes, complete cds |' 


Homo sapiens glycine cleavage system protein H (aminomethyl carrier) (GCSH) mRNA H 


Homo sapiens H factor 1 (complement) (HF1 ) mRNA |. 




Homo sapiens unconventional myosin-1 5 (L0G51 1 68), mRNA ], 


Homo sapiens unconventional myosin-1 5 (L0C51 1 68), mRNA \: 


1 
1 

i 

i 

£ 

01 

E 

1 
1 


qm01c02.x1 Scares NhHMPu S1 Homo sapiens cDNA clone IMAGE:1880546 3' similar to WP:T23G7.4 1 
CE03705 ; |' 


Homo sapiens hypothetical protein (HS322B1A), mRNA li 


Homo sapiens KIAAC187 gene product (KIAA0187), mRNA | 


i 

1 
i 

Q 

.1 
t 

X 

i 
1 

1' 

I 


i 

1 
1 

Q 

i 
1 

i 

I 

i 
i 

s 

i: 


Homo sapiens CGl-48 protein (LOC51096), mRNA |< 


Homo s^iens CGl-48 protein (LOG51096), mRNA |i 


Top Hit 
Database 
Source 


NT 


NT 


EST HUMAN 


NT 


NT 


NT 


NT 




EST HUMAN | 


NT 1 


INl 


NT 1 




EST HUMAN 


NT 1 


NT 1 


NT 1 


NT _ 


NT ] 


z 


NT 


NT ] 


z 


EST HUMAN | 


NT 


NT ] 


EST HUMAN ] 


EST HUMAN ] 


NT ] 


NT ] 


TopHitAcession 
No. 


1 


AF1 121 52.1 


AA629056.1 


D87675.1 1 


D87675.1 1 


M95708.1 1 


AL1 63246.2 | 


M95929.1 1 


BF526041.1 1 


45043741 


1 




4507512I 


BE393873.1 ] 




5453665] 


AF240786.1 | 


4758423] 


4504374] 


AFOI 5452.1 1 






AB037807.1 | 


AI2g0264.1 1 


7657185] 


i 


AA447931.1 | 


i 
1 


7705764] 


7705764] 


Most Similar 
(Top) Hit 

Value 


8.0E-95j 


8.0E-95I 


8.0E-95 


7.0E-95| 


7.0E-95I 


7.0E-95I 


7.0E-95! 


7.0E-95I 


3.0E^5| 


2.0E-95| 


2-0E-95| 


2.0E-95| 


2.OE-95I 


2.0E-95I 




2.0E-95] 


2.OE-95I 


2.0E-95] 


2.0E-95] 


1 


2.0E-95] 


2.OE-95] 


2.0E-951 


2.OE-95I 


1 


2.0E-95I 


2.0E-95I 




2.0E-95I 


S 

i 


Expression 
Signal 


2.83, 


2.411 




1 


1 


i 


^ 


8 




« 






rJ 










% 


i 




2.78] 


2.78] 




^ 




2.571 


1 


1 


^ 


3.59] 


ORFSEQ 
ID NO; 












239471 




24623 1 


250871 


i 


21387] 


21388] 


2I672I 


21676] 


22147! 






22235] 




1 


23232] 


23233] 




2339o| 


23940] 


24615] 


24663] 


1 


251211 


25122] 


Exon 
SEQ ID 
NO: 


7531 










i 




48571 


5261 


i 




i 




E 


2256 


2256 




2342 


i 






3433 




3604 


4163 








5287 


5287| 






























































Probe 
SEQID 
NO: 


i 


i 


1 






4270 


9 


1 


1 




1625 


1 


1 


1900 


23/6 


2376 




1 


2787 


3120 


3517 


3517 


3565 


1 


4264 


? 


Si 

s 






53671 



417 
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Top Hit Descriptor 


yr87h12.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:212327 5' | 


Homo sapiens chondroitin sulfate proteoglycan 4 (melanoma-associated) (CSPG4), mRNA | 




RC3-HT0230-040500-1 1 0-g02 HT0230 Homo sapiens cDNA | 


AV689461 GKC Homo sapiens cDNA clone GKCFMD07 5' | 


2819351 Sprime NIH_MGG_7 Homo sapiens cDNA clone IMAGE:2819351 5' | 


Human endogenous retrovirus type K (HERV K), gag, pol and env genes 1 


EST3671 24 MAGE resequences, MAGC Homo sapiens cDNA | 


EST367124 MAGE resequences, MAGC Homo sapiens cDNA | 




Homo sapiens flavin containing monooxygenase 2 (FM02) mRNA | 


Human hepatocyte growth factor gene, exon 1 | 


Human hepatocyte growth factor gene, exon 1 | 


Felis catus superfast myosin heavy chain (sMyHC) mRNA, complete cds | 


mRNA 1 




Homo sapiens mRNA for K1AA1290 protein, partial cds | 


Homo sapiens mRNA for KIAA1 290 protein, partial cds | 


601863712F1 NIH_MGC_57 Homo sapiens cDNA clone IMAGE:4081202 5' |i 


1L5-HT011 7-011 099-004-D07 HT0117 Homo sapiens cDNA |< 


DKFZp434N0323 r1 434 (synonym: htes3) Homo sapiens cDNA clone DKFZp434N0323 5' |' 


zw97e12.s1 Soares_NhHMPu_S1 Homo sapiens cDNA clone iMAGE:767758 3' similar to TR:G1304125 | 
G1304125 PMS4 MRNA ; 1 


Q 

! 

i 

X 

1 

9 
1 


MR0-HT0241-150500-010-b02 HT0241 Homo sapiens cDNA | 


MRO-HT0241-150500-010-b02HT0241 Homo sapiens cDNA |' 


CM0-BN01 03-1 70300-293-a06 BN0106 Homo sapiens cDNA |l 


Homo sapiens brefeldin A-inhibited guanine nucleotide-exchange protein 2 (BIG2), mRNA | 




Homo sapiens v-src avian sarcoma (Schmidt-Ruppin A-2) viral oncogene homolog (SRC), mRNA ■ 1- 


Homo sapiens mRNA for KIAA0594 protein, partial cds | 


Top Hit 
Database 
Source 


EST HUMAN | 


NT 1 


NT 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 1 


NT 1 


EST HUMAN | 


EST HUMAN | 


NT 


NT 1 


NT 1 


NT 1 


NT 1 




NT 


NT 


NT 1 


EST HUMAN 1 


EST HUMAN 1 


EST HUMAN | 


EST HUMAN 


EST HUMAN 


EST HUMAN 


EST HUMAN 


EST HUMAN 


NT 


NT 


NT 


NT 


Top HitAcession 
No. 


H68a56.1 1 


§ 


AL1 63248.2 1 


BE1 48074.1 1 




AW249440.1 1 


1 


AW955054.1 | 


AW955054.1 | 


i 


45037531 


5 


M75967.1 1 




II419429I 


1 


AB033116.1 1 


AB033116.1 1 


BF245240.1 


i 


AL043314.2 | 


AA418026.1 1 


BF1 5491 2.1 1 


BE148597.1 | 


BE148597.1 1 


BE004436.1 1 


54535721 


V1133,, 


11421793 


i 
1 


Most Similar 
(Top) Hit 
BLAST E 
Value 


3.0E-95I 




2.0E-96| 


2.0E-96I 


2.0E-96I 


2.0E-96 




1.0E-96I 


S 


1.0E-95I 


1.0E-96I 


3 


S 


3 


3 


3 


1.0E-96I 


3 


6.0E-97 


3 


5.0E-97I 


5.OE-97I 


5.0E-97I 




5.0E-97I 


4.0E-97I 


4.0E-97I 


4.0E-97 


4.0E-97 


4.0E-97I 


Signal 


6.22 








80-9 




§ 


2.031 


2.03 1 






1.33 










! 


! 


i 




1.76 


^ 


2.76 


«! 


» 




P 






1.17| 


ORFSEQ 
ID NO: 






1 


243261 






i 




21511 




21581 


i 


21970 




27125 


i 


27938 


8 


1 




26747 


268O4I 


276431 




i 




21638 


26167 


s § 




Exon 
SEQ ID 

NO: 




s 


g 


4537 


§ 




i 




1643 










g 


1 




1 


i 


3206 




1 


i 




1 








6027 


































































Probe 
SEQID 

NO: 


1 


1 




i 


7191 


9151 


i 


1742 


1742 


1 


1806 


2181 


2181 


2219 


7058 


713 




7843 


1 


i 




1 


7578 




8832 


i 


1 


i 


i i 


7328 
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III 



1 I 1- ^ 
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Top Hit Descriptor 


Homo sapiens IL2-inducible T-cell kinase (ITK), mRNA | 


1 Homo sapiens PMS2L1 6 mRNA, partial cds | 


1 Homo sapiens PMS2L16 mRNA, partial cds 1 




Homo sapiens cliromosome 21 segment HS21 C001 ] 


1 ht58f02.x1 NCLCGAP_Lu24 Homo sapiens cDN A clone IMAGE:31 51 899 3' 1 


1 AJ4031 24 3.4 (downregulated m larynx carcinoma) Homo sapiens cDNA clone 18 | 


IHomo sapiens mRNA for KIAA0707 protein, partial cds | 


i 7B' 8H01 Chromosome 7 Fetal Brain cDNA Library Homo sapiens cDNA clone 7B1 8H01 1 


i Homo sapiens activator of S phase kinase (ASK), mRNA 1 


: Homo sapiens activator of S phase kinase (ASK), mRNA 1 


yc17g09.r1 Scares adult brain N2b5HB55Y Homo sapiens cDNA clone IMAGE:178240 5' 1 


AJ4031 24 3 4 (downregulated in larynx carcinoma) Homo sapiens cDNA clone 18 1 






Homo sapiens Ran GTPase activating protein 1 (RANGAP1 ), mRNA 1 


601172658F1 NIH_MGC_17 Homo sapiens cDNA clone IMAGE:3528134 5' 1 


Homo sapiens chromosome 21 segment HS21 C002 1 


Homo sapiens potassium channel subunit (HERG-3) mRNA, complete cds | 


Homo sapiens fatty-acid-Coenzyme A ligase, long-chain 4 (FACL4) mRNA |i 


Homo sapiens attractin precursor (ATRN) gene, exon 1 6 


Homo sapiens attractin precursor (ATRN) gene, exon 16 1' 




Homo sapiens protein ti^osine kinase 2 beta (PTK2B) mRNA 




Homo sapiens SH3-domain GRB2-like 2 (SH3GL2), mRNA 1 


< 

E 

CO 

1 

a 
1 

I 


H.sapiens arginase gene exon 3 (EC 3.5.3.1) | 


Homo sapiens AIM-1 protein (L0C51 1 51 ), mRNA 1 




tv3Sb04.x1 NGI_CGAP_Ut1 Homo sapiens cDNA clone IMAGE:2261 743 3' similar to SW:RL2B_HUMAN I; 
P2931 6 60S RIBOSOMAL PROTEIN L23A. ; |; 


Top Hit 


Source 












|EST HUMAN | 


|EST HUMAN | 




|EST HUMAN | 






|EST HUMAN | 


EST HUMAN | 


EST HUMAN | 


z 


z 


|EST HUMAN | 


z 


z 


z 


z 


z : 


I z 


z 














EST_HUKMN | 


TopHitAcession 
No. 




|ABC1 7007.1 1 


lABCI 7007.1 1 


IJ04469.1 1 


IAL163201.2 1 


IBE348727.1 | 


i 
< 


1 
< 


i 
1 


i 






IAJ403124.1 1 


AJ403124.1 1 


U59309.1 




BE294281.1 | 


1 


AF032897.1 | 




AF21 8902.1 1 


AF21 8902.1 1 


§ § 
I 8 


47589751 


77065121 


114288131 


114288131 


XI 2664.1 1 




114359471 


AI862007.1 1 


IVIost Similar 
(Tod) Hit 


BLAST E 
Value 




1 8.0E-98I 


s 


1 8.0E-98I 


3 


1 4.0E-98I 


s 


1 3.0E.98I 


3 


1 3.0E-98| 


1 3.0E-98I 


j 3.0E-98I 


3 


3.CE-98I 


3 


s 


s 


3 


S 


s 


3 


S I 


33 


3 


3 


2.0E-98| 


2.0E-98I 




2.0E-98| 


2.0E-98| 


3 


1 

1 


Signal 


§ 


1 0.98 1 


0.98 


1 5.03 1 
















1 3.31 1 










Q 




1 








1 1 


q 












1.43 




ORFSEQ 


ID NO: 




1 21 468 1 




1 23424 1 




1 252201 








t 261591 


§ 


271 53 1 


s 


s 










§ 




243901 


1 


24712 




250281 


270731 


27074 1 






t 25301 1 




Exon 
SEQ ID 
- NO: 


11444 


11597 


s 


i 


14921 






i 




8 


1 


8 
8 


17535 


17535 


i 




11924 




1 




S 


if 


I ? 


i 




§ 








i 
































































Probe 
SEQ ID 
NO: 


1540 


1 


i 


1 3726| 


1 


9717 




2565 




6147 


1 61471 


1 70831 


1 


7685 




i 


2033 


2191 


3 


4244 


1 47201 




; s 




1 53031 


7004 


7004 


1 


7951 


9350 


i 
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Top Hit Descriptor 


|S01299982F1 N1H_MGC_21 Homo sapiens cDNA clone IMAGE:3629901 5' 1 


1 ameOol 0.x1 Johnston frontal cortex Homo sapiens oDNA clone 1MAGE:1 539954 3' similar to 1 
|SW:GG95_HUMAN Q08379 GOLGIN.95. ; | 


|am60c10.x1 Johnston frontal cortex Homo sapiens cDNA cbne IMAGE:1 539954 3' similar to 1 
|SW:GG95 HUMAN Q08379 GOLGIN-95. ; | 


|HomosapiensKIAA0187geneproduct(KiAA0187), mRNA 1 


1 
1 

a 

t 
i 

< 


|AU141005 PLACE4 Homo sapiens cDNA clone PLAOE4000650 5' | 




1 601 1 07843F1 NIH_MGCJ 6 Homo sapiens cDNA clone IMAGE:3343882 5' | 


|yi32c04.r1 Soares placenta Nb2HP Homo sapiens cDNA clone IMAGE:140934 5' | 


Homo sapien s protein phosphatase-1 regulatory subunit 7 (PPP1 R7) gene, exon 7 1 


[Homo sapiens HSC54 mRNA for heat shock cognate protein 54, complete cds 1 






|ar82fOax1 Barstead colon HPLRB7 Homo sapiens cDNA clone IMAGE:21 51 785 3' similar to TR:Q13137 1 
Q13137NDP52.; | 


1 Homo sapiens mRNA for Centaurin-alpha2 protein | 


IAV710738 Cu Homo sapiens cDNA clone CuAAKD03 5' | 


1 QV3-NT0025-21 0600-236-h08 NT0025 Homo sapiens cDNA | 


|601 501 107F1 NIH_MGC^70 Homo sapiens cDNA clone IMAGE:39D3145 5' |1 


|AV69481 7 GKC Homo sapiens cDNA clone GKCEEE1 1 5' |< 


AV69481 7 GKC Homo sapiens cDNA clone GKCEEE1 1 5' 


1 Homo sapiens mRNA for K1AA0454 protein, partial cds t 


|yd13d07.r1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:67021 5' ^ 


3d07.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:67021 5' \ 


AU124629 NT2RM4 Homo sapiens cDNA clone NT2RM4000309 5' 




|Homo sapiens mvomssin (M-protein) 2 (165kD) (MY0M2), mRNA |i 


IRC-BT074-260499-014 BT074 Homo sapiens cDNA | 


jRC-BT074-260499-014BT074 Homo sapiens cDNA |i 


on57h04,s1 Soares_NFL_T_GBC_S1 Homo sapiens cDNA clone IMAGE;1 560823 3' similar to li 
SW:CAV2_HUMAN P51636 CAVEOLlN-2. [1] ; \i 


|601439392F1 NIH_MGC_72 Homo sapiens cDNA clone IMAGE:3924166 5' |, 


Top Hit 
Database 


lEST HUMAN 




EST HUMAN 


z 




lEST HUMAN | 


z 


lEST HUMAN 1 




z 


z 


z 


Z 


EST HUMAN | 




|EST HUMAN | 




[EST HUMAN 1 




lEST HUMAN 1 




|EST_HUMAN 1 


lEST HUMAN | 


lEST HUMAN 






lEST HUMAN | 




EST HUMAN 


EST HUMAN | 


TopHitAcession 
No. 


|BE408447.1 1 


i 

< 


1 


1 


IAU141005.1 1 


|AU1 41 005.1 1 


i 






|AF067133.1 1 


|AB034951.1 1 


i 
g 


i 


AI459825.1 | 


i 


|AV71 0738.1 1 


IBE763051.1 1 


IBE910555.1 1 


IAV694817.1 1 


1 


IAB007923.1 | 


|T70393.1 1 


IT70393.1 1 


IAU124629.1 ■ 1 


114254301 


114254301 


IAI905037.1 1 


i 


i 

1 


i 

s 


Most Similar 
(Top)H-rt 
BLAST E 
Value 
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S 


S 
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s 
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1 


? 
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i 












P 
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ORFSEQ 
ID NO: 








1 22747| 


1 


1 


1 2381 4| 


i 


247051 


25025| 




25522| 


255231 


258181 


1 263761 




26857 






1 


27069] 


27448| 


1 


274901 


1 


28053 1 


i 




1 


1 


Exon 
SEQ ID 
NO: 






12145 


,2954 


13021 




14039 




14933 




15433 


15453 


15453 


15706 


s 
















17243 


17282 






17831 




































































Probe 
SEQ ID 
NO: 


1 


1 


2261 


1 


3094 


3094 






5063 


5300 


5515 




5536 


1 


i 


S 


1 


1 


!? 


1 


1 70011 


7374 


7374 






7961 


7981 


1 


§ 
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1 ^ !^ B 
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Top Hit Descriptor 


Homo sapiens KIAA0054 gene product; Helicase (KiAA0054), mRNA 1 


|oz31a06.x1 SoaresJotal_fetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:1676914 3' | 


|oz31a06.x1 Scares total fetus Nb2HF8_9w Homo sapiens cDNA clone IMAGE:1676914 3' | 


Homo sapiens mRNA for KIAA0995 protein, partial cds 1 


RC6-ET0081-130700-011-G01 ET0081 Homo sapiens oDNA 1 




Homo sapiens eukarv^Bc transiation initiation factor 4B (EIF4B), mRNA 1 


IHuman mRNA for KIAA0338 gene, partial cds | 


Human mRNA for KIAA0338 gene, partial cds | 


'xx32f08.x1 NCI CGAP Uti Homo sapiens oDNA clone IMAGE:2839239 3' similar to SW:CAYP_CANFA 1 
P104e3 CALCYPHOSINE ; 1 


Homo sapiens calcium channel, voltage-dependent, alpha 1 E subunit (CACNA1 E) mRNA | 


|601 121347F1 NIH MGC 20 Homo sapiens cDNA clone IMAGE:2988875 5' | 


1 Homo sapiens syriaptojanin 1 (SYNJ1 ), mRNA | 


1 Homo sapiens synaptojanin 1 (SYNJI ), mRNA | 


Homo sapiens pericentrin (PCNT) mRNA | 


[Homo sapiens pericentrin (PCNT) mRNA | 


i 

1 - 
II 

|{ 

i! 
■11 
ll 

i, 
s ^ 

1 

5 . 


Human apolipoprotein B-100 (apoB) gene, exons 17 and 18 ] 


1 Homo sapiens protein phosphatase, EF hand calcium-binding domain 1 (PPEF1) mRNA | 


iHuman olfactory receptor olfrl 7-201-1 (OR1 7-201-1) gene, olfactory receptor olfrl 7-32 (0R1 7-32) gene and I 


Homo sapiens mRNA for KIAA0790 protein, partial cds 


1 Homo sapiens DiGeorge syndrome critical region, centromeric end [ 




1 Homo sapiens sodium phosphate transporter 3 (NPT3) mRNA [ 


1 PM-BT1 35-070499-01 6 BT135 Homo sapiens cDNA f 


1 Homo sapiens partial 5-HT4 receptor gene, exons 2 to 5 [ 


zc24d07.r1 Scares senescent fibroblasts NbHSF Homo sapiens cDNA clone IMAGE.323245 5' similar to 
SW:MDHM_MOUSE P08249 MALATE DEHYDROGENASE, MITOCHONDRIAL PRECURSOR ; 


1 Homo sapiens mRNA for KIAA1 636 protein, partial cds f 


Top Hit 


|l 




EST HUMAN | 






EST HUMAN | 


EST HUMAN | 


z 






EST HUMAN | 




lEST HUMAN 1 


H 




z 




EST HUMAN 1 








z 


; 

u z 






lEST HUMAN 




EST HUMAN 


z 


Top Hit Acession 


z 


i 


< 


i 


AB023212.1 1 


BE830187.1 1 


BE830187.1 1 


114347721 




IAB002336.1 1 


AW5715441 1 


45025281 


IBE275502.1 1 


1 


1 4507334] 


51744781 


1 


AU133030.1 1 


M19824,1 


1 54539411 


|u78308.1 1 


S £ 

ll 






1 5031954! 


IAI907096.1 


|AJ243213.1 1 


W42822.1 


i 
1 
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1 
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278161 
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25641 


1 25781 
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Top Hit Descriptor 


Homo sapiens BTRCP2 mRNA for F-box and WD-repeats protein isoform 0, complete cds 1 


IHomo sapiens ALR-like protein mRNA, partial cds | 


Homo sapiens ALR-like protein mRNA, partial cds | 


(OYPIBI)mRNA | 


(CYPIBI)mRNA | 


1 
1 

1 
1 

I 

1 

1 
± 




Homo sapiens mRNA for KIAA1 527 protein, partial cds | 


Homo sapiens partial rpl3 gene for ribosomal protein L3, U82 snoRNA, U83a snoRNA and U83b snoRNA | 


Homo sapiens chromosome 21 segment HS21 C001 | 


Human AHNAK nuclooprotein mRNA, 5' end | 


PM0-HT0343-281 299-003-e02 HT0343 Homo sapiens cDNA 1 


PM0-HT0343-281 299-003-e02 HT0343 Homo sapiens cDNA 1 


i 
1 

s 

1 
1 


Homo sapiens chromosome 21 segment HS21 C068 | 








Novel human mRNA from chromosome 1 , which has similarities to BAT2 genes ] 


H.sapiens mRNA for nuclear DNA helicase II -J 


Homo sapiens protccadherin alpha CI (PCDH-alpha-CI) mRNA, complete cds J 


Homo sapiens euicaryotic translation elongation factor 1 alpha 1 (EEF1 A1) mRNA =( 


Homo sapiens eul^aryotic translation elongation factor 1 alpha 1 (EEF1A1) mRNA 1j 


< 

E 
1 

1 
1 

i 
1 

! 
f 
1 


DKFZp586G0621_r1 586 (synonym: hutel ) Homo sapiens cDNA clone DKFZp586G0621 | 


a: 

E 
1 

! 

t 

1 
1 

1 
f 






QV2-BT0635-130400-138-h03 BT0636 Homo sapiens cDNA 


|QV2-BT0636-130400-138-h03 BT0636 Homo sapiens cDNA j 


1 
! 

1 

1 

I 


Top Hit 
Database 
Source 


iNT 1 


W 1 


iNT 1 


NT 1 




iNT 1 


INT 1 


NT_ 




NT 


iNT 1 


|EST HUMAN 1 


[EST HUMAN 1 


iNT 1 


iNT . 1 


NT 


iNT 1 


iNT 1 


NT 


iNT 1 


NT 


NT 


iNT 1 


iNT 1 


|EST HUMAN | 


NT 


iNT 1 




1 


]EST HUMAN ] 


NT 1 


Top Hit Acession 
No. 


1 
§ 


i 

i 

< 


IAF264750.1 | 


1 4503202I 


i 
1 




1 
< 




|aJ238852.1 | 


|AL163201.2 1 


IW80902.1 1 


|BE1 54504.1 1 


1 


1X73428.1 1 


IAL163268.2 | 


1 


ID50657.1 1 


s 
s 

Q 


IAL096857.1 | 


IY10658.1 1 


|AF1 52303.1 1 
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CLAIMS 



1. A spatially-addressable set of single exon nucleic -acid 
probes for measuring gene expression in a sample derived 
5 from human heart comprising a plurality single exon nucleic 
probes, said probes comprising any one of the nucleotide 
sequences set out in SEQ ID NOs : 1 - 9,980 or a 
complementary sequence, or a portion of such a sequence. 

10 2. A spatially-addressable set of single exon nucleic acid 
. probes as claimed in claim 1 wherein each of said plurality 
of probes is separately and addressably amplifiable. 

3. A spatially-addressable set of single exon nucleic acid 
15 probes as claimed in claim 1 wherein each of said plurality 

of probes is separately and addressably isolatable from 
said plurality. 

4. A spatially-addressable set of single exon nucleic acid 
20 probes as claimed in any of claims 1 to 3 wherein said 

probes comprise any one of the nucleotide sequences set out 
in SEQ ID NOS.: 9,981 - 19,771. 

5. A spatially-addressable set of single exon nucleic acid 
25 probes as claimed in any of claims 1 to 4, wherein each of 

said plurality of probes is amplifiable using at least one 
common primer. 

6. A spatially-addressable set of single exon nucleic acid 
30 probes as claimed in any of claims 1 to 5 wherein the set 

comprises between 50 - 20,000 single exon nucleic acid 
probes . 

7. A spatially-addressable set of single exon nucleic acid 
35 probes as claimed in any of claims 1 to 6, wherein the 
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average length of the single exon nucleic acid probes is 
between 200 and 500 bp. 

8. A spatially-addressable set of single exon nucleic acid 

5 probes as claimed in any of claims 1 to 7, wherein at least 
50% of said single exon nucleic acid probes lack 
prokaryotic and bacteriophage vector sequence. 

9. A spatially-addressable set of single exon nucleic acid 
10 probes as claimed in any of claims 1 to 8, wherein at least 

50% of said single exon nucleic acid probes lack 
homopolymeric stretches of A or T. 

10. A spatially-addressable set of single exon nucleic acid 
15 probes as claimed in any of claims 1-9 characterised in 

that said set of probes is addressably disposed upon a 

substrate. 

11. A spatially-addressable set of single exon nucleic acid 
20 probes as claimed in claim 10 wherein said substrate is 

selected from glass, amorphous silicon, crystalline silicon 
and plastic. 

12. A microarray comprising a spatially addressable set of 
25 single exon nucleic acid probes as claimed in any of claims 

1 - 11. 

13. A single exon nucleic acid probe for measuring human 
gene expression in a sample derived from human heart 

30 comprising a nucleotide sequence as set out in any of SEQ 
ID NOs.: 1 - 9,980 or a complementary sequence or a 
fragment thereof wherein said probe hybridizes at high 
stringency to a nucleic acid molecule expressed in the 
human heart . 
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14. A single exon nucleic acid probe as claimed in claim 13 
comprising a nucleotide sequence as set out in any of SEQ 
ID NOs . : 9,981 - 19,771 or a complementary sequence or a 
fragment thereof. 

5 

15. A single exon nucleic acid probe for measuring human 
gene expression in a sample derived from human heart which 
is a nucleic acid molecule having a sequence encoding a 
peptide comprising a peptide sequence as set out in any of 

10 SEQ ID NOs.: 19,772 - 29,119, or a complementary sequence 
or a fragment thereof wherein said probe hybridizes at high 
stringency to a nucleic acid expressed in the human heart. 



16. A single exon nucleic acid probe as claimed in any one 
15 of claims 13 to 15 wherein said single exon nucleic acid 
probe comprises between 15 and 25 contiguous nucleotides of 
said SEQ ID NO. 



17. A single exon nucleic acid probe as claimed in any one 
of claims 13 to 15, wherein said probe is between 3 - 25 kb 
in length. 

18. A single exon nucleic acid probe as claimed in any one 
of claims 13 - 17, wherein said probe is DNA, RNA or PNA. 

19. A single exon nucleic acid probe as claimed in any one 
of claims 13 - 18, wherein said probe is detectably 
labeled. 



30 20. A single exon nucleic acid probe as claimed in any one 
of claims 13 - 19, wherein said probe lacks prokaryotic and 
bacteriophage vector sequence. 

21. A single exon nucleic acid probe as claimed in any one 
35 of claims 13 - 20, wherein said probe lacks homopolymeric 
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stretches of A or T. 



22. A method of measuring gene expression in a sample 
derived from human heart, comprising: 

5 contacting the microarray of claim 12, with a first 

collection of detectably labeled nucleic acids, 
said first collection of nucleic acids derived 
from mRNA of human heart; and then 
measuring the label detectably bound to each probe of 
10 said microarray. 

23. A method of identifying axons in a eukaryotic genome, 

comprising: 

algorithmica].ly predicting at least one exon from 
15 genomic sequence of said eukaryote; and then 

detecting specific hybridization of detectably labeled 
nucleic acids to a single exon probe, 
wherein said detectably labeled nucleic acids are derived 
from mRNA from the heart of said eukaryote, said probe is a 
20 single exon probe having a fragment identical in sequence 
to, or complementary in sequence to, said predicted exon, 
said probe is included within a microarray according to 
claim 12, and said fragment is selectively hybridizable at 
high stringency. 

25 

24. A method of assigning exons to a single gene, 
comprising: 

identifying a plurality of exons from genomic 
sequence according to the method of claim 23; and 

30 then 

measuring the expression of each of said exons in a 
plurality of tissues and/or cell types using 
hybridization to single exon microarrays having a 
probe with said exon, 

35 wherein a common pattern of expression of said exons in 
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said plurality of tissues and/or cell types indicates that 
the exons should be assigned to a single gene. 

25. A nucleic acid sequence as set out in any of SEQ ID 
5 NOs : 1 - 19,771 which encodes a peptide. 

26. A peptide encoded by a sequence as set out in any of 
SEQ ID Nos: 1 - 19,771. 

10 27. A peptide comprising a sequence as set out in any of 
SEQ ID Nos: 19,772 - 29,119. 



519 



wo 01/57274 



PCT/USOl/00666 




GENOMIC SEQUENCE 
DATABASE 



IDENTIFY FUNCTiONAL SEQUENCES j 
WITHIN GENOMIC DATA 






IDENTIFY SUBSET 
FUNCTIONAL SEQUE 
FOR CHOSEN ASS 


OF 

NCES 
AY 






CREATE PHYSICAL AND/OR 
INFORMATIONAL SUBSTRATE AND 
PERFORM CHOSEN ASSAY 








ANNOTATE SEQUENCE DATA 



SEQUENCE 
ANNOTATION 
FROM EXTERNAL 
SOURCES 



/ 31SPLAY 
>■ ( ANNOTATED 
\SEQUENCE 



wo 01/57274 



PCT/USOl/00666 



2/1 0 




QUERY GENOMIC 
SEQUENCE DATABASE 




PREPROCESS SEQUENCE 
FOR DESIRED APPROACH 
AND METHOD 




wo 01/57274 PCT/USOl/00666 



] □ □ □ 0 

^g-j-i, 

ID D □ □ Q □ 



Fig. 3 



wo 01/57274 



PCT/USOl/00666 



4/1 0 



Grail 
Genefinder 
Diction 
EST hit 
Qiip seq. 

Expression^nd 



m 1 

«4 1 

m 








Base number 



■^1 



Fig. 4 



wo 01/57274 



PCT/USOl/00666 



5/1 0 



5000 



4000 



3000 



LU 

o 2000 



1000 




15 30 45 60 75 90 10 12 13 15 16 18 19 21 22 24 
50 00 50 00 50 00 50 00 50 00 



Length 



Fig. 5 



wo 01/57274 PCT/USOl/00666 



1000 



800 - 



§. 600 - 
o 



X2 

z 200 - 



0 




0 1 2345678 9 10 



Number of Tissues 



Fig. 6 



wo 01/57274 



PCT/USOl/00666 



8/10 



'^t CO CM 



E 

2 
O 
CO 



wo 01/57274 



PCT/USOl/00666 



lO/lO 

Fig. lO 




